
Table 1: Cleavage of 75 human light chains. 

Enzyme Recognition* Nch Ns Planned location of site 





Afel 


AGCgct 


0 


0 






Aflll 


Cttaag 


0 


0 


HC FR3 


5 


Age I 


Accggt 


0 


0 






AscI 


GGcgcgcc 


0 


0 


After LC 




Bglll 


Agatct 


0 


0 






BsiWI 


Cgtacg 


0 


0 




10 


BspDI 


ATcgat 


0 


0 




BssHII 


Gcgcgc 


0 


0 






BstBI 


TTcgaa 


0 


0 






Drain 


CACNNNgtg 


0 


0 






EagI 


Cggccg 


0 


0 






Fsel 


GGCCGGcc 


0 


0 




Fspl 


TGCgca 


0 


0 






Hpal 


GTTaac 


0 


0 






Mfel 


Caattg 


0 


0 


HC FR1 




Mlul 


Acgcgt 


0 


0 




20 


NcoX 


Ccatgg 


0 


0 


Heavy chain signal 


Nhel 


Gctagc 


0 


0 


HC/anchor linker 




NotI 


GCggccgc 


0 


0 


In linker after HC 




Nrul 


TCGcga 


0 


0 






Pad 


TTAATtaa 


0 


0 




□ 


Pmel 


GTTTaaac 


0 


0 






Pmll 


CACgtg 


0 


0 






Pvul 


CGATcg 


0 


0 




3 . 3 

- y> 


SacII 


CCGCgg 


0 


0 






Sail 


Gtcgac 


0 


0 




Sfil 


GGCCNNNNnggcc 


0 


0 


Heavy Chain signal 




Sgfl 


GCGATcgc 


0 


0 






SnaBI 


TACgta 


0 


0 




StuI 


AGGcct 


0 


0 






Xbal 


Tctaga 


0 


0 


HC FR3 




Aatll 


GACGTc 


1 


1 




S3 J=5 


Acll 


AAcgtt 


1 


1 






Asel 


ATtaat 


1 


1 






BsmI 


GAATGCN 


1 


1 






BspEI 


Tccgga 


1 


1 


HC FR1 


No 


BstXI 


CCANNNNNntgg 


1 


1 


HC FR2 


DrdI 


GACNNNNnngtc 


1 


1 






Hindlll 


Aagctt 


1 


1 






Pcil 


Acatgt 


1 


1 






Sapl 


gaagagc 


1 


1 




45 


Seal 


AGTact 


1 


1 




SexAI 


Accwggt 


1 


1 






Spel 


Actagt 


1 


1 






Tlil 


Ctcgag 


1 


1 






Xhol 


Ctcgag 


1 


1 




50 


Bcgl 


egannnnnntge 


2 


2 




Blpl 


GCtnagc 


2 


2 






BssSI 


Ctcgtg 


2 


2 






BstAPI 


GCANNNNntgc 


2 


2 






Espl 


GCtnagc 


2 


2 




55 


KasI 


Ggcgcc 


2 


2 




PflMI 


CCANNNNntgg 


2 


2 






XmnI 


GAANNnnttc 


2 


2 
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Express Mail Label j 
No. EI125454892US \ 





ApaLI 


Gtgcac 


3 


3 




Nael 


GCCggc 


3 


3 




NgoMI 


Gccggc 


3 


3 




PvuII 


CAGctg 


3 


3 


5 


RsrII 


CGgwccg 


3 


3 




BsrBI 


GAGcgg 


4 


4 




BsrDI 


GCAATGNNn 


4 


4 




BstZ17I 


GTAtac 


4 


4 


10 


. EcoRI 


Gaattc 


4 


4 


SphI 


GCATGc 


4 


4 




Sspl 


AATatt 


4 


4 




AccI 


GTmkac 


5 


5 




Bell 


Tgatca 


5 


5 


15 


BsmBI 


Nnnnnngagacg 


5 


5 


BsrGI 


Tgtaca 


5 


5 




Dral 


TTTaaa 


6 


6 




Ndel 


CAtatg 


6 


6 




Swal 


ATTTaaat 


6 


6 


20 


BamHI 


Ggatcc 


7 


7 


Sad 


GAGCTc 


7 


7 




BciVI 


GTATCCNNNNNN 


8 


8 




BsaBI 


GATNNnnatc 


8 


8 




Nsil 


ATGCAt 


8 


8 




Bspl20l 


Gggccc 


9 


9 




Apal 


GGGCCc 


9 


9 


rni 

X. US 


PspOOMI 


Gggccc 


9 


9 


w 


BspHI 


Tcatga 


9 


11 




EcoRV 


GATatc 


9 


9 




Ahdl 


GACNNNnngtc 


11 


11 


Bbsl 


GAAGAC 


11 


14 


E3 


Psil 


TTAtaa 


12 


12 




Bsal 


GGTCTCNnnnn 


13 


15 


? 


Xmal 


Cccggg 


13 


14 


Or 


Aval 


Cycgrg 


14 


16 




Bgll 


GCCNNNNnggc 


14 


17 


^ . 


AlwNI 


CAGNNNctg 


16 


16 




BspMI 


ACCTGC 


17 


19 




Xcml 


CCANNNNNnnnntgg 


17 


26 




BstEII 


Ggtnacc 


19 


22 




Sse8387I 


CCTGCAgg 


20 


20 




Avrll 


Cctagg 


22 


22 




Hindi 


GTYrac 


22 


22 




Bsgl 


GTGCAG 


27 


29 




MscI 


TGGcca 


30 


34 


45 


BseRI 


NNnnnnnnnnctcctc 


32 


35 




Bsu36I 


CCtnagg 


35 


37 




PstI 


CTGCAg 


35 


40 




Ecil 


nnnnnnnnntccgcc 


38 


40 


50 


PpuMI 


RGgwccy 


41 


50 


Styl 


Ccwwgg 


44 


73 




EcoO109l 


RGgnccy 


46 


70 




Acc65I 


Ggtacc 


50 


51 




Kpnl 


GGTACc 


50 


51 


55 


Bpml 


ctccag 


53 


82 


Avail 


Ggwcc 


71 


124 



LC signal seq 



CHI 



* cleavage occurs in the top strand after the last upper-case base. For REs 
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that cut palindromic sequences, the lower strand is cut at the symmetrical 
site. 

Table 2: Cleavage of 79 human heavy chains 



10 



15 



20 



P 
\«* 

u 



130 

X S3, 



l 35 

I-* 

^ . H 



45 



50 



55 





Recocrnition 


Ncn 


Ns 


B f o T 


7\ 

AbLgCu 


0 


0 


Aril I 


Cttaag 


0 


0 


AscI 


GGcgcgcc 


0 


0 


BsiWI 


Cgtacg 


0 


0 


BspDI 


ATcgat 


0 


0 


BssHII 


Gcgcgc 


0 


0 


Fsel 


GGCCGGcc 


0 


0 


Hpal 


GTTaac 


0 


0 


Nhel 


Gctagc 


0 


0 


Not I 


GCggccgc 


0 


0 


Nrul 


TCGcga 


0 


0 


Nsil 


ATGCAt 


0 


0 


Pad 


TTAATtaa 


0 


0 


Pcil 


Acatgt 


0 


0 


Pmel 


GTTTaaac 


0 


0 


Pvul 


CGATcg 


0 


0 


RsrII 


CGgwccg 


0 


0 


SapT 


gaagagc 


0 


0 


Sfil 


GGCCNNNNnggcc 


0 


0 


Sgfl 


GCGATcgc 


0 


0 


Swal 


ATTTaaat 


0 


0 


Acll 


AAcgtt 


1 


1 


Age I 


Accggt 


1 


1 


Asel 


ATtaat 


1 


1 


Avrll 


Cctagg 


1 


1 


BsmI 


GAATGCN 


1 


1 


BsrBI 


GAGcgg 


1 


1 


BsrDI 


GCAATGNNn 


1 


1 


Dral 


TTTaaa 


1 


1 


Fspl 


TGCgca 


1 


1 


Hindlll 


Aagctt 


1 


1 


Mfel 


Caattg 


1 


1 


Nael 


GCCggc 


1 


1 


NgoMI 


Gccggc 


1 


1 


Spel 


Actagt 


1 


1 


Acc65I 


Ggtacc 


2 


2 


BstBI 


TTcgaa 


2 


2 


Kpnl 


GGTACc 


2 


2 


Mlul 


Acgcgt 


2 


2 


Ncol 


Ccatgg 


2 


2 


Ndel 


CAtatg 


2 


2 


Pmll 


CACgtg 


2 


2 


Xcml 


CCANNNNNnnnntgg 


2 


2 


Bcgl 


egannnnnntge 


3 


3 


Bell 


Tgatca 


3 


3 


Bgll 


GCCNNNNnggc 


3 


3 


BsaBI 


GATNNnnatc 


3 


3 


BsrGI 


Tgtaca 


3 


3 


SnaBI 


TACgta 


3 


3 


Sse8387l 


CCTGCAgg 


3 


3 



Planned location of site 



HC FR3 
After LC 



HC Linker 

In linker, HC/anchor 



HC signal seq 



In HC signal seq 
HC FR4 



[18] 



ApaLI Gtgcac 4 

BspHI Tcatga 4 

BssSI Ctcgtg 4 

Psil TTAtaa 4 

5 SphI GCATGc 4 

Ahdl GACNNNnngtc 5 

BspEI Tccgga 5 

MscI TGGcca 5 

SacI GAGCTc 5 

10 Seal AGTact 5 

SexAI Accwggt 5 

Sspl AATatt 5 

Tlil Ctcgag 5 

Xhol Ctcgag 5 

15 Bbsl GAAGAC 7 

BstAPI GCANNNNntgc 7 

BstZ17I GTAtac 7 

EcoRV GATatc 7 

EcoRI Gaattc 8 

20 BlpI GCtnagc 9 

Bsu36l CCtnagg 9 

Drain CACNNNgtg 9 

Espl GCtnagc 9 

StuI AGGcct 9 

25 Xbal Tctaga 9 

I % Bspl 201 Gggccc 1 0 

t% Apal GGGCCc 10 

X% PspOOMI Gggccc 10 

^ BciVI GTATCCNNNNNN 11 

'3D Sail Gtcgac 11 

C3 DrdI GACNNNNnngtc 12 

m KasI Ggcgcc 12 

s Xmal Cccggg 12 

Bglll Agatct 14 

S§ Hindi GTYrac 16 

BamHI Ggatcc 17 

r* PflMI CCANNNNntgg 17 

% 4 BsmBI Nnnnnngagacg 18 

p BstXI CCANNNNNntgg 18 

: 0 XmnI GAANNnnttc 18 

SacII CCGCgg 19 

PstI CTGCAg 20 

PvuII CAGctg 20 

Aval Cycgrg 21 

45 EagI Cggccg 21 

Aatll GACGTc 22 

BspMI ACCTGC 27 

AccI GTmkac 30 

Sty I Ccwwgg 36 

50 AlwNI CAGNNNctg 38 

Bsal GGTCTCNnnnn 38 

PpuMI RGgwccy 43 

Bsgl GTGCAG 44 

BseRI NNnnnnnnnnctcctc 48 

55 Ecil nnnnnnnnntccgcc 52 

BstEII Ggtnacc 54 

EcoO109l RGgnccy 54 



4 LC Signal/FRl 

4 
4 
5 
4 
5 

5 HC FR1 

5 
5 
5 
6 
5 
5 
5 
8 
8 
7 
7 
8 
9 
9 
9 
9 
13 

9 HC FR3 
11 CHI 
11 CHI 

11 
11 
12 
12 
12 
14 
14 
18 
17 
18 
21 

19 HC FR2 

18 
19 
24 
22 
24 
22 
22 
33 
43 
49 
44 
44 
46 
54 
60 
57 

61 HC Fr4, 47/79 have one 

86 
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Bpml • et.ccag 
Aval! Ggwc'c 



60 121 
71 140 



C3 

IS 

w 

N- 

la 

C3 

en 



Z * >. 



Table 5 (araowdod) : Use of Fokl as "Universal Restriction Enzyme" 



Fokl - for dsDNA, | represents sites of cleavage 

sites of cleavage 
5 1 -cac GGATG tq--nnnnnnn I nnnnnnn-3 1 ( SEP ID N0:15) 
3 1 -gtgCCTACac — nnnnnnnnnnn | nnn-5 1 ( SEQ ID NO: 16) 
RECOG 

NITion of FoJcI 

Case I 

5'-. . .gtg I tatt-actgtgc. .Substrate -3 1 (SEQ ID NO: 17) 

3 1 -cac- ataa I tgacacg— [ 

gtGTAGGcacX 
5'- caCATCCgtg/ (SEQ ID NO: 18) 

Case II 

5'-. . -gtgtattlagac-tgc. .Substrate -3 ! (SEQ ID N0:19) 

r -cacataa -tctg | acg-5 1 
/gtgCCTACac 

\cacGGATGtg-3 ' (SEQ ID NO:20) 

Case III (Case I rotated 180 degrees) 

/gtgCCTACac-5 f 
\cacGGATGt£— | 

gtgtctt lacag-tcc-3 1 Adapter (SEQ ID NO: 21) 
3 1 -. . .cacagaa-tgtc | agg. .substrate -5' (SEQ ID NO:22) 

Case IV (Case II rotated 180 degrees) 




3'- gtGTAGGcac\ (SEQ ID NO: 23) 
r—caCATCCgtg/ 
5 1 -gag | tctc-actgaac 
Substrate 3 1 -. . . ctc-agag | tgactcg . . .-5' (SEQ ID NO: 24) 

Improved Fokl adapters 



Fokl - for dsDNA, | represents sites of cleavage 
Case I 

Stem 11, loop 5, stem 11, recognition 17 

5 1 - . . . catgtg | tatt-actgtgc. . Substrate .... -3 1 
3 1 -qtacac- ataa I tgacacg— | r T— | 

gtGTAGGcacG T 
5 1 - caCATCCgtgc C 
Ltt-' 



Case II 

Stem 10, loop 5, stem 10, recognition 18 

5 • - . . . gtgtatt | agac-tgctgcc . . Substrate .... -3 ' 
. r T l w |— cacataa-tctgl acgacgg-5 1 

fl T gtgCCTACac 

C cacGGATGtg-3 1 



|J Case III (Case I rotated 180 degrees) 

Stem 11, loop 5, stem 11, recognition 20 



-.A 



ll r T n 

frz T TgtgCCTACac-5 1 

v= " G AcacGGATGtq— } 

JL L TT J gtatctt I acaq-tccattcta-3 1 Adapter 

^ 3 1 - . . . cacagaa-tgtc | aggtaagac . . substrate , 



U Case IV (Case II rotated 180 degrees) 

Stem 11, loop 4, stem 11, recognition 17 

3 f - gtGTAGGcacc T 
[— caCATCCgtgg T 
5 1 -atcgag I tctc-actgagc Lt- 1 
Substrate 3 1 - . . . tagctc-agag | tgactcg. . . -5 1 



BseRI 

I sites of cleavage 
5 1 -cacGAGGAGnnnnnnnnnn | nnnnn-3 1 
3 1 -gtgctcctcnnnnnnnn | nnnnnnn-5 1 
RECOG 

NITion of BseRI 

Stem 11, loop 5, stem 11, recognition 19 

3 ' - gaacat | cg-ttaagccagta 5 1 

fT-T n cttgta-gc I aattcggtcat-3 1 

C GCTGAGGAGTC-- 1 

T cgactcctcag-5 1 An adapter for BseRI to cleave the substrate above. 
L T I 



rn. 



Table 8: Matches to URE 


FR3 adapters in 79 human HC. 




A. List of 


Heavy-chains 


genes sampled 






AF008566 




afl03343 




HSA235676 


HSU92452 


UC7 Q-3Q Cf| 


AF035043 




AF103367 




HSA235675 


HSU94412 




AF103026 




AF103368 




HSA235674 


HSU94415 




afl03033 




AF103369 




HSA235673 


HSU94416 




AF103061 




AF103370 




HSA240559 


HSU94417 


GOO"] A ^ 


Afl03072 




afl03371 




HSCB201 


HSU94418 


jO^ / D4 


afl03078 




AF103372 




HSIGGVHC 


HSU96389 




AF103099 




AF158381 




HSU44791 


HSU96391 


or\Cj v n j u _? 


AF103102 




E05213 




HSU44793 


HSU96392 


orilJEj J. \J V n 


AF103103 




E05886 




HSU82771 


HSU96395 


Orifl^ luvn 


AF103174 




E05887 




HSU82949 


HSZ93849 


O UnJluvn 


AF103186 




HSA235661 




HSU82950 


HSZ93850 


jiuvni i u 


afl03187 




HSA235664 




HSU82952 


HSZ93851 


CHIKA TtZ\TU 

ouiai luvn 


AF103195 




HSA235660 




HSU82961 


HSZ93853 




afl03277 




HSA235659 




HSU86522 


HSZ93855 




afl03286 




HSA235678 




HSU86523 


HSZ93857 




AF103309 




HSA235677 










Table 8 B. Testing all distinct GLGs from bases 89.1 to 93.2 of the heavy variable domain 


Id 


Nb 


0 1 


2 


3 4 




SEQ ID NO: 


1 


38 


15 11 


10 


0 2 Seql 


gtgtattactgtgc 


25 


2 


19 


7 6 


4 


2 0 Seq2 


gtAtattactgtgc 


26 


3 


1 


0 0 


1 


0 0 Seq3 


gtgtattactgtAA 


27 


4 


7 


1 5 


1 


0 0 Seq4 


gtgtattactgtAc 


28 


5 


0 


0 0 


0 


0 0 Seq5 


Ttgtattactgtgc 


29 


6 


0 


0 0 


0 


0 0 Seq6 


TtgtatCactgtgc 


30 


7 


3 


1 0 


1 


1 0 Seq7 


ACAtattactgtgc 


31 


8 


2 


0 2 


0 


0 0 Seq8 


ACgtattactgtgc 


32 


9 


9 


2 2 


4 


1 0 Seq9 


ATatattactatac 


33 


Group 




26 26 


21 


4 2 






Cumulative 




26 52 


73 


77 79 







[51] 



• 



Table 8C Most important URE recognition seqs in FR3 Heavy 

1 VHSzyl GTGtattactgtgc (ON_SHC103) (SEQ ID NO: 25) 

2 VHSzy2 GTAtattactgtgc (ON_SHC323) (SEQ ID NO: 26) 

3 VHSzy4 GTGtattactgtac (ON_SHC349) (SEQ ID NO: 28) 
5 4 VHSzy9 ATGtattactgtgc (ON_SHC5a) (SEQ ID NO: 33) 

Table 8D, testing 79 human HC V genes with four probes 



10 



Number of sequences 79 

Number of bases 29143 



Number of mismatches 
Id Best 0 1 2 3 4 5 

1 39 15 11 10 1 2 0 Seql gtgtattactgtgc (SEQ ID NO:25) 

2 22 765301 Seq2 gtAtattactgtgc (SEQ ID NO: 26) 
15 3 7 151000 Seq4 gtgtattactgtAc (SEQ ID NO:28) 

4 11 244100 Sea9 ATatattactatac (SEQ ID NO:33) 
n Group 25' 26 20 5 2 

3 Cumulative 25 51 71 7 6 78 

f fj 

4& 3 One sequence has five mismatches with sequences 2, 4, and 9; it is scored as best for 2. 

IJ 

^ Id is the number of the adapter. 

Best is the number of sequence for which the identified adapter was the best available. 

The rest of the table shows how well the sequences match the adapters. For example, there are 11 

L_*. 

|3 sequences that match VHSzyl (Id=l) with 2 mismatches and are worse for all other adapters. In 
C3 this sample, 90% come within 2 bases of one of the four adapters. 



Table 130: PCR primers for amplification of human Ab genes 
(HulgMFOR) 5'-tgg aag agg cac gtt ctt ttc ttt-3 ! 

! (HulgMFOREtop) 5 1 -aaa gaa aag aac gtg cct ctt cca-3 1 = reverse complement 
(HuCkFOR) 5 f -aca etc tec cct gtt gaa get ctt-3 f 

(HuCL2FOR) 5*-tga aca ttc tgt agg ggc cac tg-3 ! 

(HuCL7FOR) 5'-aga gca ttc tgc agg ggc cac tg-3 ! 

! Kappa 

(CKForeAsc) 5 ! -acc gee tec ace ggg cgc gee tta tta aca etc tec cct gtt- 
gaa get ctt-3' 

(CL2ForeAsc) 5'-acc gee tec ace ggg cgc gee tta tta tga aca ttc tgt- 
agg ggc cac tg-3* 

(CL7ForeAsc) 5' -ace gec tec acc ggg cgc gee tta tta aga gca ttc tgc- 
agg ggc cac tg-3 f 



Table 195: Human GLG FR3 sequences 
! VH1 

! 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 



agg 


gtc 


acc 


81 


82 


82a 


gag 


ctg 


age 


93 


94 


95 


gcg 


aga 


ga 


aga 


gtc* 


acc 


gag 


ctg 


age 


gcg 


aga 


ga 


aga 


gtc 


acc 


gag 


ctg 


age 


gcg 


aga 


gg 


aga 


gtc 


acc 


gag 


ctg 


agg 


gcg 


aga 


ga 


aga 


gtc 


acc 


gag 


ctg 


age 


gca 


aca 


ga 


aga 


gtc 


acc 


gag 


ctg 


age 


gca 


aga 


ta 


aga 


gtc 


acc 


gag 


ctg 


age 


gcg 


aga 


ga ! 



atg acc agg 
82b 82c 83 
agg ctg aga 

l-02# 1 
att acc agg 
age ctg aga 

l-03# 2 
atg acc agg 
age ctg aga 

l-08# 3 
atg acc aca 
age ctg aga 

1-18# 4 
atg acc gag 
age ctg aga 

l-24# 5 
att acc agg 
age ctg aga 

l-45# 6 
atg acc agg 
age ctg aga 

l-46# 7 



gac 


acg 


tec 


84 


85 


86 


tct 


gac 


gac 


gac 


aca 


tec 


LCt 


gaa 


gac 


aac 


acc 


tec 


tct 


g a g 


gac 


gac 


aca 


tec 


+- r-> +- 


gac 


gac 


gac 


aca 


tct 




g a g 


gac 


gac 


agg 


tct 


tct 


gag 


gac 


gac 


acg 


tec 


tct 


gag 


gac 



ate 


age 


aca 


87 


88 


89 


acg 


gee 


gtg 


gcg 


age 


aca 


acg 


get 


gtg 


ata 


age 


aca 


acg 


gee 


gtg 


acg 


age 


aca 


acg 


gee 


gtg 


aca 


gac 


aca 


acg 


gee 


gtg 


atg 


age 


aca 


aca 


gee 


atg 


acg 


age 


aca 


acg 


gee 


gtg 



gee tac atg 
90 91 92 
tat tac tgt 

gee tac atg 
tat tac tgt 

gee tac atg 
tat tac tgt 

gee tac atg 
tat tac tgt 

gee tac atg 
tat tac tgt 

gee tac atg 
tat tac tgt 

gtc tac atg 
tat tac tgt 






aga 


gtc 


acc 


att acc agg 




gag 


ctg 


age 


age ctg aga 




gcg 


gca 


ga 


! l-58# 8 




aga 


gtc 


acg 


att acc gcg 


5 


gag 


ctg 


age 


age ctg aga 




gcg 


aga 


ga ! 


! l-69# 9 




aga 


gtc 


acg 


att acc gcg 




gag 


ctg 


age 


age ctg aga 




gcg 


aga 


ga ! 


! l-e# 10 


10 


aga 


gtc 


acc 


ata acc gcg 




gag 


ctg 


age 


age ctg aga 




gca 


aca 


ga ! 


! l-f# 11 




! VH2 










agg 


etc 


acc 


ate acc aag 


15 


aca 


atg 


acc 


aac atg gac 


O 


gca 


cac 


aga 


c! 2-05# 12 


agg 


etc 


acc 


ate tec aag 


£0 


acc 


atg 


acc 


aac atg gac 




gca 


egg 


ata 


c! 2-26# 13 




agg 


etc 


acc 


ate tec aag 


O 


aca 


atg 


acc 


aac atg gac 




gca 


egg 


ata 


c! 2-70# 14 


3 


! VH3 








f 1 


cga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 




gcg 


aga 


ga ! 


! 3-07# 15 


f 1 
■a sp- 

.3 E 


cga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


agt ctg aga 




gca 


aaa 


gat 


a! 3-09#16 




cga 


ttc 


acc 


ate tec agg 




caa 


atg 


aac 


age ctg aga 




gcg 


aga 


ga ! 


! 3-ll# 17 




cga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 


35 


gca 


aga 


ga ! 


! 3-13# 18 




aga 


ttc 


acc 


ate tea aga 




caa 


atg 


aac 


age ctg aaa 




acc 


aca 


ga ! 


3-15# 19 




cga 


ttc 


acc 


ate tec aga 




gac atg tec aca age aca gec tac atg 

tec gag gac acg gee gtg tat tac tgt 

gac gaa tec acg age aca gec tac atg 

tct gag gac acg gec gtg tat tac tgt 

gac aaa tec acg age aca gee tac atg 

tct gag gac acg gee gtg tat tac tgt 

gac acg tct aca gac aca gec tac atg 

tct gag gac acg gee gtg tat tac tgt 

gac acc tec aaa aac cag gtg gtc ctt 

cct gtg gac aca gee aca tat tac tgt 

gac acc tec aaa age cag gtg gtc ctt 

cct gtg gac aca gee aca tat tac tgt 

gac acc tec aaa aac cag gtg gtc ctt 

cct gtg gac aca gee acg tat tac tgt 

gac aac gec aag aac tea ctg tat ctg 

gee gag gac acg get gtg tat tac tgt 

gac aac gee aag aac tec ctg tat ctg 

get gag gac acg gee ttg tat tac tgt 

gac aac gec aag aac tea ctg tat ctg 

gee gag gac acg gec gtg tat tac tgt 

gaa aat gee aag aac tec ttg tat ctt 

gee ggg gac acg get gtg tat tac tgt 

gat gat tea aaa aac acg ctg tat ctg 

acc gag gac aca gee gtg tat tac tgt 

gac aac gee aag aac tec ctg tat ctg 






caa 


atg 


aac 


agt ctg aga 




gcg 


aga 


ga 


! 3-20# 20 




cga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 


5 


gcg 


aga 


ga 


! 3-21# 21 




egg 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 




gcg 


aaa 


ga 


! 3-23# 22 




cga 


ttc 


acc 


ate tec aga 


JO 


caa 


atg 


aac 


age ctg aga 




gcg 


aaa 


ga 


! 3-30# 23 




cga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 




gcg 


aga 


ga ! 


! 3303# 24 


15 


cga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 


o 


gcg 


aaa 


ga ! 


! 3305# 25 


m 


cga 


ttc 


acc 


ate tec aga 


u 


caa 


atg 


aac 


age ctg aga 


s m 

i ■ a 

C3 


gcg 


aga 


ga ! 


! 3-33# 26 


cga 


ttc 


acc 


ate tec aga 


3. : S 


caa 


atg 


aac 


agt ctg aga 


3 


gca 


aaa 


gat 


a! 3-43#27 


O 


cga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 




gcg 


aga 


ga ! 


! 3-48# 28 




aga 


ttc 


acc 


ate tea aga 


caa 


atg 


aac 


age ctg aaa 




act 


aga 


ga ! 


! 3-49# 29 


30 


cga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 




gcg 


aga 


ga ! 


! 3-53# 30 




aga 


ttc 


acc 


ate tec aga 




caa 


atg 


ggc 


age ctg aga 


35 


gcg 


aga 


ga ! 


! 3-64# 31 




aga 


ttc 


acc 


ate tec aga 




caa 


atg 


aac 


age ctg aga 




gcg 


aga 


ga ! 


3-66# 32 




aga 


ttc 


acc 


ate tea aga 




gee gag gac acg gec ttg tat cac tgt 

gac aac gee aag aac tea ctg tat ctg 
gee gag gac acg get gtg tat tac tgt 

gac aat tec aag aac acg ctg tat ctg 
gec gag gac acg gec gta tat tac tgt 

gac aat tec aag aac acg ctg tat ctg 
get gag gac acg get gtg tat tac tgt 

gac aat tec aag aac acg ctg tat ctg 
get gag gac acg get gtg tat tac tgt 

gac aat tec aag aac acg ctg tat ctg 
get gag gac acg get gtg tat tac tgt 

gac aat tec aag aac acg ctg tat ctg 
gee gag gac acg get gtg tat tac tgt 

gac aac age aaa aac tec ctg tat ctg 
act gag gac acc gec ttg tat tac tgt 

gac aat gee aag aac tea ctg tat ctg 
gac gag gac acg get gtg tat tac tgt 

gat ggt tec aaa age ate gee tat ctg 
acc gag gac aca gec gtg tat tac tgt 

gac aat tec aag aac acg ctg tat ctt 
gec gag gac acg gec gtg tat tac tgt 

gac aat tec aag aac acg ctg tat ctt 
get gag gac atg get gtg tat tac tgt 

gac aat tec aag aac acg ctg tat ctt 
get gag gac acg get gtg tat tac tgt 

gat gat tea aag aac tea ctg tat ctg 





caa 


atg 




get 


aga 




agg 


ttc 




caa 


atg 


5 


act 


aga 




cga 


ttc 




caa 


atg 




gca 


aga 




aga 


ttc 


10 


caa 


atg 




aag 


aaa 




! VH4 






cga 


gtc 




aag 


ctg 


15 


gcg 


aga 




cga 


gtc 


q 


aag 


ctg 




gcg 


aga 


hi 


cga 


gtt 




aag 


ctg 




gcg 


aga 




cga 


gtc 


5 


aag 


ctg 




gec 


aga 




cga 


gtt 


? ™ 


aag 


ctg 


: . * 


gec 


aga 




cga 


gtt 




aag 


ctg 


30 


gcg 


aga 




cga 


gtc 




aag 


ctg 




gcg 


aga 




cga 


gtc 


35 


aag 


ctg 




gcg 


aga 




cga 


gtc 




aag 


ctg 




gcg 


aga 



aac age ctg aaa 
ga ! 3-72# 33 
acc ate tec aga 
aac age ctg aaa 
ca ! 3-73# 34 
acc ate tec aga 
aac agt ctg aga 
ga ! 3-74# 35 
acc ate tec aga 
aac age ctg aga 
ga ! 3-d# 36 

acc ata tea gta 
age tct gtg acc 
ga ! 4-04# 37 
acc atg tea gta 
age tct gtg acc 
aa ! 4-28# 38 
acc ata tea gta 
age tct gtg act 
ga ! 4301# 39 
acc ata tea gta 
age tct gtg acc 
ga ! 4302# 40 
acc ata tea gta 
age tct gtg act 
ga ! 4304# 41 
acc ata tea gta 
age tct gtg act 
ga ! 4-31# 42 
acc ata tea gta 
age tct gtg acc 
ga ! 4-34# 43 
acc ata tec gta 
age tct gtg acc 
ca ! 4-39# 44 
acc ata tea gta 
age tct gtg acc 
ga ! 4-59# 45 



acc gag gac acg 

gat gat tea aag 
acc gag gac acg 

gac aac gee aag 
gee gag gac acg 

gac aat tec aag 
get gag gac acg 

gac aag tec aag 
gec gcg gac acg 

gac acg tec aag 
gee gtg gac acg 

gac acg tct aag 
gec gcg gac acg 

gac agg tec aag 
gee gcg gac acg 

gac acg tec aag 
gec gca gac acg 

gac acg tct aag 
gec gcg gac acg 

gac acg tec aag 
gec gcg gac acg 

gac acg tec aag 
gee gca gac acg 

gac acg tec aag 
get gcg gac acg 



gee gtg tat tac tgt 

aac acg gcg tat ctg 
gee gtg tat tac tgt 

aac acg ctg tat ctg 
get gtg tat tac tgt 

aac acg ctg cat ctt 
get gtg tat tac tgt 

aac cag ttc tec ctg 
gec gtg tat tac tgt 

aac cag ttc tec ctg 
gee gtg tat tac tgt 

aac cag ttc tec ctg 
gee gtg tat tac tgt 

aac cag ttc tec ctg 
gec gtg tat tac tgt 

aac cag ttc tec ctg 
gec gtg tat tac tgt 

aac cag ttc tec ctg 
gee gtg tat tac tgt 

aac cag ttc tec ctg 
get gtg tat tac tgt 

aac cag ttc tec ctg 
get gtg tat tac tgt 

aac cag ttc tec ctg 
gec gtg tat tac tgt 



cga gtc acc ata tea gta gac 
aag ctg age tct gtg acc get 
gcg aga ga ! 4-63J 46 
cga gtc acc ata tea gta gac 
aag ctg age tct gtg acc gec 
gcg aga ga ! 4-b# 47 
! VH5 

cag gtc acc ate tea gee gac 
cag tgg age age ctg aag gee 
gcg aga ca ! 5-51# 48 
cac gtc acc ate tea get gac 
cag tgg age age ctg aag gee 
gcg aga ! 5-a# 49 
! VH6 

cga ata acc ate aac cca gac 
cag ctg aac tct gtg act ccc 
gca aga ga ! 6-l# 50 
! VH7 

egg ttt gtc ttc tec ttg gac 
cag ate tgc age eta aag get 
gcg aga ga ! 74 . 1# 51 



acg tec aag aac cag ttc tec ctg 

gcg gac acg gee gtg tat tac tgt 

acg tec aag aac cag ttc tec ctg 

gca gac acg gee gtg tat tac tgt 

aag tec ate age acc gee tac ctg 

teg gac acc gee atg tat tac tgt 

aag tec ate age act gee tac ctg 

teg gac acc gee atg tat tac tgt 

aca tec aag aac cag ttc tec ctg 

gag gac acg get gtg tat tac tgt 

acc tct gtc age acg gca tat ctg 

gag gac act gee gtg tat tac tgt 



Table 250: REdaptors, Extenders, and Bridges used for Cleavage and Captu 
Human Heavy Chains in FR3. 

A: HpyCH4V Probes of actual human HC genes 

!HpyCH4V in FR3 of human HC, bases 35-56; only those with TGca site 
TGca; 10, 

RE recognition: tgca of length 4 is expected at 10 

1 6-1 agttctccctgcagctgaactc 



2 
3 
4 
5 
6 
7 
8 
9 



3-11, 3-07, 3-21, 3-72, 3-48 cactgtatctgcaaatgaacag 

3-09,3-43, 3-20 ccctgtatctgcaaatgaacag 

5-51 ccgcctacctgcagtggagcag 

3-15, 3-30, 3-30 .5, 3-30.3, 3-74,3-23, 3-33 cgctgtatctgcaaatgaacag 

7-4.1 cggcatatctgcagatctgcag 

3-73 cggcgtatctgcaaatgaacag 

5-a ctgcctacctgcagtggagcag 

3-49 tcgcctatctgcaaatgaacag 



B : HpyCH4V REdaptors, Extenders, and Bridges 
B.1 REdaptors 

Cutting HC lower strand: 

TmKeller for 100 mM NaCl, zero formamide 
Edapters for cleavage 



(ON_HCFR36-l) 
(0N_HCFR36-1A) 
(0N_HCFR36-1B) ■ 
(ON_HCFR33-15) 
(ON_HCFR33-15A) 
(ON_HCFR33-15B) 
(ON_HCFR33-ll) 
(ON HCFR35-51) 



5 ' -agttctcccTGCAgctgaactc-3 ■ 
5 1 -ttctcccTGCAgctgaactc-3 1 
5 1 -ttctcccTGCAgctgaac-3 1 

5 ■ -cgctgtatcTGCAaatgaacag-3 * 
5 ■ -ctgtatcTGCAaatgaacag-3 1 
5 1 -ctgtatcTGCAaatgaac-3 1 

5 1 -cactgtatcTGCAaatgaacag-3 1 

5 1 -ccgcctaccTGCAgtggagcag-3 1 



m W 

68. 
62. 
56. 
64. 
56. 
50. 
62. 
74. 



rn K 

64.5 
62.5 
59.9 
60.8 
56.3 
53.1 
58.9 
70.1 



B2 Segment of synthetic 3-23 gene into which captured CDR3 is to be cloned 

Xbal . . . 

D323* cgCttcacTaag tcT aqa gac aaC tcT aag aaT acT etc taC 



scab designed gene 3-23 gene. 

HpyCH4V 

Aflll... 
Ttg caG atg aac ag e TtA aa G 



B3 Extender and Bridges 

! Extender (bottom strand) 



(ON_HCHpyEx01 ) 5 ' - c AAgTAgAgAgTATTcTTAgAgTTgTcTcTAgAcTTAgTgAAgcg - 3 
ON_HCHpyEx01 is the reverse complement of 

5 1 -cgCttcacTaag tcT aqa gac aaC tcT aag aaT acT etc taC Ttg -3' 



Bridges (top strand, 9-base overlap) : 



(ON_HCHpyBr016-l) 5 1 -cgCttcacTaag tcT aqa gac aaC tcT aag- 

aaT acT etc taC Ttg CAgctgaac-3 1 (3 1 -term C is blocked} 

i 

! 3-15 et al. + 3-11 

(ON_HCHpyBr023-15) 5 1 -cgCttcacTaag tcT aqa gac aaC tcT aag- 

aaT acT etc taC Ttg CAaatgaac-3' {3 1 -term C is blocked} 

j 

! 5-51 

(ON_HCHpyBr045-51) 5 1 -cgCttcacTaag tcT aqa gac aaC tcT aag- 

aaT acT etc taC Ttg CAgtggagc-3 1 {3 1 -term C is blocked} 

j 

! PCR primer (top strand) 
i 

(ONJiCHpyPCR) 5 1 -cgCttcacTaag tcT aqa gac-3' 

; 

C: BlpI Probes from human HC GLGs 

1 1-58, 1-03, 1-08, 1-69, 1-24 , 1-45 , 1-4 6, 1-f , 1-e acatggaGCTGAGCagcctgag 

2 1-02 acatggaGCTGAGCaggctgag 

3 1-18 acatggagctgaggagcctgag 

4 5-51, 5-a acctgcagtggagcagcctgaa 

5 3-15 , 3-73,3-49, 3-72 atctgcaaatgaacagcctgaa 

6 3303,3-33,3-07,3-11,3-30,3-21,3-23,3305,3-48 atctgcaaatgaacagcctgag 

7 3-20, 3-74, 3-09, 3-43 atctgcaaatgaacagtctgag 

8 74.1 atctgcagatctgcagcctaaa 

9 3-66, 3-13, 3-53, 3-d atcttcaaatgaacagcctgag 

3-64 atcttcaaatgggcagcctgag 

11 4301, 4-28, 4302, 4-04, 4304, 4-31, 4-34, 4-39, 4-59, 4-61, 4-b ccctgaaGCTGAGCtctgtgac 

12 6-1 ccctgcagctgaactctgtgac 

13 2-70, 2-05 tccttacaatgaccaacatgga 

14 2-2 6 tccttaccatgaccaacatgga 
D : BlpI REdaptors, Extenders, and Bridges 

D.l REdaptors 

T m w T m K 

(BlpF3HCl-58) 5'-ac atg gaG CTG AGC age ctg ag-3 1 70 66.4 

(BlpF3HC6-l) 5 f -cc ctg aag ctg age tct gtg ac-3* 70 66.4 

! BlpF3HC6-l matches 4-30.1, not 6-1. 



10 



D2 Segment of synthetic 3-23 gene into which captured CDR3 is to be cloned 



BlpI 

Xbal... 

D323+ cgCttcacTaag TCT AGA gac aaC tcT aag aaT acT etc taC Ttg caG atg aac 

Aflll. . . 
aq C TTA AG G 



10 



15 



H20 



v3 



D3 Extender and Bridges 

! Bridges 

(BlpF3Brl) 5 1 -cgCttcacTcag tcT aga gaT aaC AGT aaA aaT acT TtG- 
taC Ttg caG Ctg a | GC age ctg-3 1 

(BlpF3Br2) 5 1 -cgCttcacTcag tcT aga gaT aaC AGT aaA aaT acT TtG- 
taC Ttg caG Ctg a | gc tct gtg-3 1 

! I lower strand is cut here 



! Extender 
(BlpF3Ext) 



5 ' - 



TcAgcTgcAAgTAcAAAgTATTTTTAcTgTTATcTcTAgAcTgAgTgAAgcg - 3 1 
BlpF3Ext is the reverse complement of: 

5* -cgCttcacTcag tcT aga gaT aaC AGT aaA aaT acT TtG taC Ttg caG Ctg a-3 ! 

(BlpF3PCR) 5 '-cgCttcacTcag tcT aga gaT aaC-3' 
E: HpyCHAIII Distinct GLG sequences surrounding site, bases 77-98 



%25 


i 


102#1,118#4,146#7,16 


9#9 f le#fl0,311##17,353#*30, 4 04#*37, 4 301 


ccgtgtattactgtgcgagaga 


2 


103#*2,307*#15,321##21, 


3303*#24,333#26,34 8*#28,364**31, 366 


132 


ctgtgtattactgtgcgagaga 




3 




108##3 


ccgtgtattactgtgcgagagg 




4 




124#*5,lf# 


#11 


ccgtgtattactgtgcaacaga 




5 




145#6 


ccatgtattactgtgcaagata 




6 




15* 


3 ##8 


ccgtgtattactgtgcggcaga 


,30 


7 




205* 


*12 


ccacatattactgtgcacacag 




8 




226# 


#13 


ccacatattactgtgcacggat 




9 




270# 


#14 


ccacgtattactgtgcacggat 




10 




309##16,343# 


#27 


ccttgtattactgtgcaaaaga 


35 


11 




313##18,374##35, 61* 


#50 


ctgtgtattactgtgcaagaga 


12 




315* 


#19 


ccgtgtattactgtaccacaga 




13 




320* 


#20 


ccttgtatcactgtgcgagaga 




14 




323* 


*22 


ccgtatattactgtgcgaaaga 




15 




330#23,3305* 


#25 


ctgtgtattactgtgcgaaaga 


40 


16 




349* 


#29 


ccgtgtattactgtactagaga 


17 




372##33 


ccgtgtattactgtgctagaga 




18 




373* 


#34 


ccgtgtattactgtactagaca 




19 




3d* 


#36 


ctgtgtattactgtaagaaaga 




20 




428* 


#38 


ccgtgtattactgtgcgagaaa 


45 


21 




4302##40, 4304# 


#41 


ccgtgtattactgtgccagaga 


22 




439* 


#44 


ctgtgtattactgtgcgagaca 




23 




551* 


#48 


ccatgtattactgtgcgagaca 



^ 4 5a#49 ccatgtattactgtgcgaga 

F: HpyCH4III REdaptors, Extenders, and Bridges 
F.l REdaptors 

! ONs for cleavage of HC (lower) in FR3 (bases 77-97) 
! For cleavage with HpyCH4III, Bst4CI, or Taal 
! cleavage is in lower chain before base 88. 



1 

(H43, 


.77, 


.97. 


,1-02#1) 


5' 


77 788 888 888 

78 901 234 567 
-cc gtg tat tAC 


889 
890 
TGT 


999 
123 
gcg 


999 
456 
aga 


9 
7 

g- 


-3' 


rn W 
A m 

64 


rn K 

*■ m 

62. 6 


(H43. 


.77, 


.97. 


,l-03#2) 


5' 


-c$| gtg tat tAC 


TGT 


gcg 


aga 


g- 


■3' 


62 


60.6 


(H43, 


.77, 


.97. 


,108#3) 


5' 


-cc gtg tat tAC 


TGT 


gcg 


aga 


g- 


■3' 


64 


62.6 


(H43. 


,77, 


.97. 


323#22) 


5' 


-cc gtjj| tat tac 


tgt 


gcg 


a i a 


g- 


■3 f 


60 


58.7 


(H43. 


,77. 


,97. 


330#23) 


5' 


-c|| gtg tat tac 


tgt 


gcg 


aj§a 


g- 


3' 


60 


58.7 


(H43. 


,77. 


,97. 


439#44) 


5' 


-c|| gtg tat tac 


tgt 


gcg 


aga 


I- 


3' 


62 


60.6 


(H43. 


.77. 


.97. 


551#48) 


5' 


-cc §jtg tat tac 


tgt 


gcg 


aga 


1" 


3' 


62 


60.6 


(H43. 


77. 


97. 


5a#49) 


5 1 


-cc itg tat tAC 


TGT 


gcg 


aga 




3* 


58 


58.3 



F . 2 Extender and Bridges 

! Xbal and Aflll sites in bridges are bunged 

(H43.XABrl) 5 ' -ggtgtagtga- 

| TCT | AGt | gac | aac | tct | aag | aat | act | etc | tac | ttg | cag | atg | - 
|aac|a qC|TTt |AGg|gct I gag I gac 1 aCT I GCAI Gtcl tac I tat tgt gcg aga-3 f 

(H43 . XABr2 ) 5 ' -ggtgtagtga- 

I TCT | AGt | gac | aac | tct | aag | aat | act | etc | tac | ttg | cag | atg | - 

I aac I a qC I TTt I AGcr I act I aaa I aac I aCT I GCA I Gt c I tac 1 tat tgt gcg aaa-3 ! 

(H43.XAExt) 5 1 -ATAgTAgAcT gcAgTgTccT cAgcccTTAA gcTgTTcATc TgcAAgTAgA- 
gAgTATTcTT AgAgTTgTcT cTAgATcAcT AcAcc-3 1 

!H43.XAExt is the reverse complement of 

! 5 1 -ggtgtagtga- 

! | TCT | AGA | gac | aac | tct | aag | aat | act | etc | tac | ttg | cag | atg | - 
! I aac I a qC I TTA I AGq I get I gag I gac I aCT I GCA I Gt c I tac I tat -3 1 

(H43.XAPCR) 5'-ggtgtagtga | TCT | AGA | gac | aac- 3' 
! Xbal and Aflll sites in bridges are bunged 
(H43 . ABrl ) 5 1 -ggtgtagtga- 

I aac I a qC I TTt I AGq I get I gag I gac I aCT I GCA I Gtcl tac I tat tgt gcg aga-3 1 
(H43 . ABr2 ) 5 1 -ggtgtagtga- 

I aac | a gC I TTt I AGg I get I gag I gac I aCT I GCAI Gtc I tac I tat tgt gcg aaa-3' 
(H43 . AExt ) 5 1 - ATAgTAgAcT gcAgTgTccTcAgcccTTAAgcTgTTTcAcTAcAcc- 3 1 




!(H43.AExt) is the reverse complement of 5 ' -ggtgtagtga- 
! I aac I a qC I TTA I AGg I get I gag I aac I aCT I GCAI Gtc i tac I tat -3* 
(H43.APCR) 5 f -ggtgtagtga | aac I agC I TTA I AGg 1 get I g-3 ' 
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Sites to be varied > *** *** *** 

FR1 > | . . . CDR1 | FR2 

46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 
ASGFTFSSYAMSWVR 
lactlTCCIGGAIttclactlttcl tct | tCG | TAG | Get | atg | tc 1 1 tgg I gtt I cgC I 143 
I cga | agg| cct | aag | tga | aagl aga | age | atg | cga | tac | aga | acc | caa | gcgl 

I BspEI | t BsiWIi IBstXI. 

Sites to be varies > *** *** *** 

FR2 >| . . . CDR2 

61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 
QAPGKGLEWVSAI SG 
|CAa|gct|ccTIGG t I aaa I grt I ttg I gag I torr I gtt I tct I get I ate I tct I otrt I 188 
I gtt | cga | gga | cca | ttt | cca | aac | etc | acc | caa | aga I cga I tag I aga | cca | 
...BstXI | 

**+ *** 

CDR2 I FR3 

76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 
SGGSTYYADSVKGRF 
I tct j ggt | ggc I agrt I act I tac I t at I get I qac I tec I gtt 1 aaa I go t I cqc I ttc I 233 
I aga | cca | ccg I tea | tga | atg I ata | cga | ctg | agg | caa | ttt | cca | gcg | aag | 

FR3 

91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 
TISRDNSKNT LYLQM 
I act | ate | TCT | AGA | gac | aac | tct | aag | aat I act | etc I tac I ttg I cag | atg | 278 
I tga | tag | aga I tc t | ctg | ttg | aga | ttc | tta | tga | gag | atg | aac | gtc | tac | 
I Xbal | 

FR3 >| 

106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 
NSLRAEDTAVYYCAK 
I aac | a gC|TTA|AGg|gctlgag|gaclaCTI GCA | Gtc | tac | tat | tgc | get | aaa | 323 
I ttg | teg | aat | tec | cga | etc | ctg | tga | cgt | cag | a tg | at a | acg | cga | ttt | 
lAflll I | PstI | 

CDR3 | FR4 

121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 
DYEGTGYAFDIWGQG 
I ga.c I tat | gaa | ggt | act I ggt I tat I get I ttc I gaC I ATA I TGg I ggt I c aa I aat I 368 
I ctg | ata | ctt | cca | tga | cca | ata I cga | aag | ctg | tat | acc | cca | gtt | cca | 

I Ndel | 

FR4 >| 

136 137 138 139 140 141 142 
T M V T V S S 
I act | atG | GTC | ACC | gtc I tct | agt- 389 
I tga | tac | cag | tgg | cag | aga | tca- 
I BstEII | 

143 144 145 146 147 148 149 150 151 152 

ASTKGPSVFP 
gee tec acc aaG GGC CCa teg GTC TTC ccc-3' 419 
! egg agg t gg ttc ccg ggt age cag aag ggg -5 ' 

! Bspl20I. BbsI...(2/2) 

! Apal .... 

(SFPRMET) 5 ! -ctg tct gaa cG GCC cag ccG-3' 
(T0PFR1A) 5' -ctg tct gaa cG GCC cag ccG GCC atg gec- 
gaa | gtt | CAA I TTG | tta | gag i tct | ggt I - 

I ggc | ggt | ctt I gtt | cag | cct 1 ggt | ggt | tct | tta-3 ' 
(BOTFR1B) 3'-caa|gtc|gga|cca|cca|aga|aat|gca|gaa|aga|acg|cga|- 

I cga | agg | cct | aag | tga | aag- 5' ! bottom strand 



10 



15 



20 



25 



(B0TFR2) 3' -acc|caa|gcg|- 

Igtt legal gga |cca| ttt | cca| aac |ctc | acc| caa|aga| -5 * ! bottom strand 
(B0TFR3) 3 1 - a|cga|ctg|agg|caa|ttt|cca|gcg|aag|- 

|tga| tag|aga| tct|ctg| ttg|aga| ttc|tta| tga | gag | atg | aac | gtc | tac|- 
| ttg I teg I aat | tec | cga | etc | ctg | tga-5 1 
(F06) 5 ' -gC I TTAI AGg | get | gag I gac I aCT I GCA| Gtc I tac I tat I tgc | get | aaa I - 

I gac I tat I gaa | ggt | act | ggt I tat I get | ttc | gaC I ATA| TGg I ggt I c-3* 
(BOTFR4 ) 3 • -cga | aag| ctgl tat | acc | cca | gtt | cca | - 
I tga I tac | cag | tgg | cag| aga | tca- 

egg agg tgg ttc ccg ggt age cag aag ggg-5' ! bottom strand 
(BOTPRCPRIM) 3*-gg ttc ccg ggt age cag aag ggg-5 1 

CDR1 diversity 

ON- vgC 1 ) 5 * - IqctlTCCIGGAIttclactlttcl tct I<1> I TAC 1<1> I atg I<1> I - 

CDR1 ^859 

I tgg; I gtt I cqC I CAa I get I ccT I GG -3 ' 

<1> stands for an equimolar mix of {ADEFGHIKLMNPQRSTVWY} ; no C 

(this is not a sequence) 

CDR2 diversity 

(0N-vgC2 ) 5 * -ggt | ttg I gag | tgg j gtt | tct | <2> | ate I <2> | <3> | - 

CDR2 

I tct I ggt I ggc | <1> | act I <1> I tat | get | gac | tec I gtt | aaa I gg-3 1 

CDR2 

<1> is an equimolar mixture of {ADEFGHIKLMNPQRSTVWY}; no C 
<2> is an equimolar mixture of { YRWVGS } ; no ACDEFHI KLMNPQT 
<3> is an equimolar mixture of {PS}; no ACDEFGHIKLMNQRTVWY 



Table 800 (new) 



The following list of enzymes was taken from 
http : //rebase . neb . com/cai-bin/asymmlist . 

I have removed the enzymes that a) cut within the recognition, b) 
cut on both sides of the recognition, or c) have fewer than 2 
bases between recognition and closest cut site. 



REBASE Enzymes 
04/13/2001 



Type II restriction 
sequences : 



enzymes with asymmetric recognition 



m 

h is? 

SSS 

WW: 
'•z $ 

"•4 

Q 



Isoschizomers 



NN" 



Enzymes Recognition Sequence 

Aarl CACCTGCNNNN A NNNN_ 

Acelll CAGCTCNNNNNNN A NNNN_ 

Bbr7I GAAGACNNNNNNN A NNNN_ 

Bbvl GCAGCNNNNNNNN A NNNN_ 

Bb v 1 1 GAAGACNN A NNNN_ 

Bce83I CTTGAGNNNNNNNNNNNNNN_ 

BceAI ACGGCNNNNNNNNNNNN A NN_ 

Beef I ACGGCNNNNNNNNNNNN A N_ 

BciVI GTATCCNNNNN_N A 

Bfil ACTGGGNNNN_N A 

Bin I GGAT CNNNN A N_ 

BscAI GCAT CNNNN A NN_ 

BseRI GAGGAGNNNNNNNN_NN A 

BsmFI GGGACNNNNNNNNNN A NNNN_ 

BspMI ACCTGCNNNN A NNNN_ 

Ecil GGCGGANNNNNNNNN_NN A 

Eco57I CTGAAGNNNNNNNNNNNNNN 

Faul CCCGCNNNN A NN_ 

Fokl GGAT GNNNNNNNNN A NNNN_ 

Gsul CTGGAGNNNNNNNNNNNNNN 

Hgal GACGCNNNNN A NNNNN_ 

HphI GG T GANNNNNNN_N A 

MboII GAAG ANNNNNNN_N A 

Mlyl GAGTCNNNNN A SchI 

Mmel TCCRACNNNNNNNNNNNNNNNNNN_NN A 

Mnll CCTCNNNNNN_N A 

Plel GAG T CNNNN A N_ PpsI 

RleAI C C C AC ANNNNNNNNN_NNN A 

SfaNI GCATCNNNNN A NNNN_ BspST5I 

SspD5I GGTGANNNNNNNN A 

Sthl32I CCCGNNNN A NNNN_ 

StsI GGAT GNNNNNNNNNN A NNNN_ - 

Taqll GACCGANNNNNNNNN_NN A , CACCCANNNNNNNNN 

Tthlllll CAARCANNNNNNNNN_NN A 

UbaPI CGAACG 



Bful 
Bmrl 



BspLUllIII 
Acc36I 

NN A BspKT5I 
BstFZ438I 
BstPZ418I 

NN A 

AsuHPI 



Suppliers 

y 



y 
y 



y 
y 
y 
y 
y 
y 
y 
y 
y 
y 
y 
y 

y 
y 



NN' 



The notation is A means cut the upper strand and _ means cut the 
lower strand. If the upper and lower strand are cut at the same 
place, then only " appears. 



Table 120: MAX I A3, annotated 
! MALIA3 9532 bases 
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aat 


get 


act 


act 


af f 


d y L. 


aga 


att 


gat 


gec 


3 r* c 

U l_ 


ttt 


tea 




cgc 


gec 
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! gene ii continued 






























49 


cca 


aat 


gaa 


aat 


a ta 


get 


aaa 


cag 


att 


att 


gac 


cat 


tta 


cga 


aat 


gta 






97 


tct 


aat 


ggt 


caa 


act 


aaa 


tct 


act 


cgt 


teg 


cag 


aat 


taa 


gaa 


tea 


act 






145 


gtt 


aca 


tgg 


aat 


gaa 


act 


tec 


aga 


cac 


cgt 


act 


tta 


att 


gca 


tat 


tta 






193 


aaa 


cat 


gtt 


gag 


eta 


cacr 


cae 


caa 

<~uy 


att 


cag 


caa 


tta 


aac 


tct 


aag 


cca 




10 


241 


tec 


gca 


aaa 


atg 


acc 


tct 


tat 


caa 


aag 


gag 


caa 


tta 


aag 


gta 


etc 


tct 






289 


aat 


cct 


gac 


ctg 


tta 


gag 


ttt 


get 


tec 


ggt 


ctg 


gtt 


cgc 


ttt 


gaa 


get 






337 


cga 


att 


aaa 


acg 


caa 


tat 


ttg 


aag 


tct 


ttc 


ggg 


ctt 


cct 


ctt 


aat 


ctt 






385 


ttt 


gat 


gca 


ate 


cere 


ttt 


get 


tct 


gac 


tat 


aat 


agt 


J cag 


aat 


aaa 


ga c 






433 


ctg 


att 


ttt 


gat 


tta 


tgg 


tea 


ttc 


teg 


ttt 


tct 


gaa 


eta 


ttt 


aaa 


gca 




15 


481 


ttt 


gag 


ggg 


gat 


tea 


ATG 


aat 


att 


tat 


gac 


gat 


tec 


gca 


gta 


ttg 


gac 






1 




RBS? 






Start gene x, ii continues 










Ij 
%3 






get 


ate 


cag 


tct 


aaa 


cat 


ttt 


act 


att 


acc 


ccc 


tct 


rr n 

ggc 


-a -a a 
a. ex. a 










577 


ttt 


gca 


aaa 


gee 


tct 


cgc 


tat 


ttt 


ggt 


ttt 


tat 


cgt 




ctg 


y u ct 


aac 


iJ . 

u a 




625 


gag 


ggt 


tat 


gat 


agt 


gtt 


get 


ctt 


act 


atg 


cct 


cgt 


aat 


tec 


ttt 


tgg 


*^ 
^ J 


20 


673 


cgt 


tat 


gta 


tct 


gca 


tta 


gtt 


gaa 


tgt 


ggt 


att 


cct 


aaa 


tct 


caa 


ctg 


■a -er 

C.3 




721 


atg 


aat 


ctt 


tct 


ace 


tgt 


aat 


aat 


gtt 


gtt 


ccg 


tta 


gtt 


cgt 


ttt 


att 


Is : ^ 




769 


aac 


gta 


gat 


ttt 


tct 


tec 


caa 


cgt 


cct 


gac 


tgg 


tat 


aat 


era ct 
y ay. 


cca 


att 


S 




817 


ctt 


aaa 


ate 


gca 


























C3 




i 












X & 


II 






















25 


832 

1 


ggtaattca ca 


























C3 




i 


Ml 








E5 










Q10 










T15 








843 


ATG 


att 


aaa 


gtt 


gaa 


att 


aaa 


cca 


tct 


caa 


gee 


caa 


ttt 


act 


act 


cgt 








Start gene V 




























30 










































S17 






S20 










P25 










E30 










891 


tct 


ggt 


gtt 


tct 


cgt 


cag 


ggc 


aag 


cct 


tat 


tea 


ctg 


aat 


gag 


cag 


ctt 



V35 E40 V45 

939 tgt tac gtt gat ttg ggt aat gaa tat ccg gtt ctt gtc aag att act 

D50 A55 L60 

987 ctt gat gaa ggt cag cca gee tat gcg cct ggt cTG TAC Acc gtt cat 

BsrGI . . . 



L65 V70 S75 R80 

1035 ctg tec tct ttc aaa gtt ggt cag ttc ggt tec ctt atg att gac cgt 



P85 



K87 end of V 



1083 ctg cgc etc gtt ccg get aag TAA C 



1108 ATG gag cag gtc gcg gat ttc gac aca att tat cag gcg atg 
Start gene VII 



1150 ata caa ate tec gtt gta ctt tgt ttc gcg ctt ggt ata ate 



VII and IX overlap. 



S2 V3 L4 V5 



S10 



1192 get ggg ggt caa agA TGA gt gtt tta gtg tat tct ttc gee tct ttc gtt 

End VII 
I start IX 



1242 tta ggt tgg tgc ctt cgt agt ggc att acg tat ttt ace cgt tta atg gaa 

1293 act tec tc 

.... stop of IX, IX and VIII overlap by four bases 
1301 ATG aaa aag tct tta gtc etc aaa gee tct gta gee gtt get acc etc 
Start signal sequence of viii. 

1349 gtt ccg atg ctg tct ttc get get gag ggt gac gat ccc gca aaa gcg 

mature VIII > 

1397 gee ttt aac tec ctg caa gee tea gcg acc gaa tat ate ggt tat gcg 
1445 tgg gcg atg gtt gtt gtc att 

1466 gtc ggc gca act ate ggt ate aag ctg ttt aag 
1499 aaa ttc acc teg aaa gca ! 1515 



L13 



WIS 



G20 



T25 



E29 



-35 



1517 



age tga taaaccgat acaattaaag gctccttttg 



-10 



1552 gagecttttt ttttGGAGAt ttt ! S.D. underlined 



< III signal sequence 



> 



MKKLLFAIPLV 
1575 caac GTG aaa aaa tta tta ttc gca att cct tta gtt ! 1611 





V 


P 


F 


Y 


S 


H 


S 


A 


Q 


















1612 


gtt 


cct 


ttc 


tat 


tct 


cac 


aGT 


gcA Cag 


tCT 






























ApaLI . , 




















1642 




GTC 


GTG 


ACG 


CAG 


CCG 


CCC 


TCA 


GTG 


TCT 


GGG 


GCC 


CCA" 


GGG 


CAG 










AGG 


GTC 


ACC 


ATC 


TCC 


TGC 


ACT 


GGG 


AGC 


AGC 


TCC 


AAC 


ATC 


GGG 


GCA 








BstEII. . . 




























1729 




GGT 


TAT' GAT 


GTA 


CAC 


TGG 


TAC 


CAG 


CAG 


CTT 


CCA 


GGA 


ACA 


GCC 


CCC 


AAA 


1777 




CTC 


CTC 


ATC 


TAT 


GGT 


AAC 


AGC 


AAT 


CGG 


CCC 


TCA 


GGG 


GTC 


CCT 


GAC 


CGA 


1825 




TTC 


TCT 


GGC 


TCC 


AAG 


TCT 


GGC 


ACC 


TCA 


GCC 


TCC 


CTG 


GCC 


ATC 


ACT 




1870 




GGG 


CTC 


CAG 


GCT 


GAG 


GAT 


GAG 


GCT 


GAT 


TAT 














1900 




TAC 


TGC 


CAG 


TCC 


TAT 


GAC 


AGC 


AGC 


CTG 


AGT 














1930 




GGC 


CTT 


TAT 


GTC 


TTC 


GGA 


ACT 


GGG 


ACC 


AAG 


GTC 


ACC 


GTC 






























BstEII. . . 










1969 




CTA 


GGT 


CAG 


CCC 


AAG 


GCC 


AAC 


CCC 


ACT 


GTC 


ACT 












2002 




CTG 


TTC 


CCG 


CCC 


TCC 


TCT 


GAG 


GAG 


CTC 


CAA 


GCC 


AAC 


AAG 


GCC 


ACA 


CTA 


2050 




GTG 


TGT 


CTG 


ATC 


AGT 


GAC 


TTC 


TAC 


CCG 


GGA 


GCT 


GTG 


ACA 


GTG 


GCC 


TGG 


2098 




AAG 


GCA 


GAT 


AGC 


AGC 


CCC 


GTC 


AAG 


GCG 


GGA 


GTG 


GAG 


ACC 


ACC 


ACA 


CCC 


2146 




TCC 


AAA 


CAA 


AGC 


AAC 


AAC 


AAG 


TAC 


GCG 


GCC 


AGC 


AGC 


TAT 


CTG 


AGC 


CTG 


2194 




ACG 


CCT 


GAG 


CAG 


TGG 


AAG 


TCC 


CAC 


AGA 


AGC 


TAC 


AGC 


TGC 


CAG 


GTC 


ACG 


2242 




CAT 


GAA 


GGG 


AGC 


ACC 


GTG 


GAG 


AAG 


ACA 


GTG 


GCC 


CCT 


ACA 


GAA 


TGT 


TCA 



2290 TAA TAA ACCG CCTCCACCG G GCGCGCCA AT TCTATTTCAA GGAGACAGTC ATA 

AscI 



PelB signal > 

MKYLLPTAAAGLLLL 
2343 ATG AAA TAC CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC 



16 17 18 19 20 21 22 

A A Q P A MA 

2388 gcG GCC cag ccG G CC atq q cc 

Sfil 

NgoMI. . . (1/2) 

Ncol 



2409 



FR1 (DP47/V3-23) 

23 24 25 26 27 28 29 30 
EVQLLESG 
gaa | gtt | CAA| TTG | tta | gag | tct | ggt | 
t Mfel | 



10 



2433 



FR1 -- 

31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 
GGLVQPGGSLRLSCA 
I ggc | ggt | ctt | gtt | cag | cct | ggt | ggt | tct | tta I cgt | ctt | tct | tgc | get | 



%3 

- JK, 

3 : 3 
•5 -a? 

y 

w 

p 

in 



If sj 



is, 



15 



20 



25 



2478 



2523 



FR1 >| . . .CDR1 | FR2 

46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 
ASGFTFSSYAMSWVR 
I get | TCC | GGA| ttc | act | ttc | tct | tCG | TAG | Get | atg | tct | tgg | gtt | cgC | 

I BspEI | | BsiWI| IBstXI. 



FR2 

61 62 63 64 
Q A P G 



>| . . . CDR2 

65 66 67 68 69 70 71 72 73 74 75 
KGLEWVSAISG 



I CAa | get | ccT | GGt | aaa | ggt | ttg | gag | tgg | gtt | tct | get | ate | tct | ggt | 
BstXI | 



2568 



CDR2 | FR3 

76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 
SG GS TYYADSVKGRF 
I tct | ggt | ggc | agt | act | tac | tat | get | gac | tec | gtt | aaa | ggt | cgc | ttc | 
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i I V K C N .End of I 
iv L3 L N5 V 17 N F V10 
5775 att gtt aaa tgt aat TAA T TTT GTT 
IV continued 

5800 ttc ttg atg ttt gtt tea tea tct tct ttt get cag gta att gaa atg 
5848 aat aat teg cct ctg cgc gat ttt gta act tgg tat tea aag caa tea 
5896 ggc gaa tec gtt att gtt tct ccc gat gta aaa ggt act gtt act gta 
5944 tat tea tct gac gtt aaa cct gaa aat eta cgc aat ttc ttt att tct 
5992 gtt tta cgt get aat aat ttt gat atg gtt ggt tea att cct tec ata 
6040 att cag aag tat aat cca aac aat cag gat tat att gat gaa ttg cca 
6088 tea tct gat aat cag gaa tat gat gat aat tec get cct tct ggt ggt 
6136 ttc ttt gtt ccg caa aat gat aat gtt act caa act ttt aaa att aat 
6184 aac gtt egg gca aag gat tta ata cga gtt gtc gaa ttg ttt gta aag 



6232 tct aat act tct aaa tec tea aat gta tta tct att gac ggc tct aat 
6280 eta tta gtt gtt TCT gca cct aaa gat att tta gat aac ctt cct caa 

ApaLI removed 

6328 ttc ctt tct act gtt gat ttg cca act gac cag ata ttg att gag ggt 
6376 ttg ata ttt gag gtt cag caa ggt gat get tta gat ttt tea ttt get 
6424 get ggc tct cag cgt ggc act gtt gca ggc ggt gtt aat act gac cgc 
6472 etc ace tct gtt tta tct tct get ggt ggt teg ttc ggt att ttt aat 
6520 ggc gat gtt tta ggg eta tea gtt cgc gca tta aag act aat age cat 
6568 tea aaa ata ttg tct gtg cca cgt att ctt acg ctt tea ggt cag aag 
6616 ggt tct ate tct gtT GGC CAg aat gtc cct ttt att act ggt cgt gtg 
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6664 act ggt gaa tct gec aat gta aat aat cca ttt cag acg att gag cgt 
6712 caa aat gta ggt att tec atg age gtt ttt cct gtt gca atg get ggc 
6760 ggt aat att gtt ctg gat att acc age aag gee gat agt ttg agt tct 
6808 tct act cag gca agt gat gtt att act aat caa aga agt att get aca 
6856 acg gtt aat ttg cgt gat gga cag act ctt tta etc ggt ggc etc act 
6904 gat tat aaa aac act tct caa gat tct ggc gta ccg ttc ctg tct aaa 
6952 ate cct tta ate ggc etc ctg ttt age tec cgc tct gat tec aac gag 
7000 gaa age acg tta tac gtg etc gtc aaa gca acc ata gta cgc gee ctg 
7048 TAG eggegcatt 
End IV 

7060 aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc 
7120 gcccgctcct ttegctttet tcccttcctt tctcgccacg ttcGCCGGCt ttccccgtca 

NgoMI_ 

7180 agctctaaat cgggggctcc ctttagggtt ccgatttagt getttaegge acctcgaccc 
7240 caaaaaactt gatttgggtg atggttCACG TAGTGggcca tcgccctgat agacggtttt 

Drain 

7300 tcgccctttG ACGTTGGAGT Ccacgttctt taatagtgga ctcttgttcc aaactggaac 
DrdI 

7360 aacactcaac cctatctcgg gctattcttt tgatttataa gggattttgc egatttegga 
7420 accaccatca aacaggattt tcgcctgctg gggcaaacca gcgtggaccg ettgetgeaa 
7480 ctctctcagg gecaggeggt gaagggcaat CAGCTGttgc cCGTCTCact ggtgaaaaga 

PvuII. BsmBI. 

7540 aaaaccaccc tGGATCC AAGCTT 

BamHI Hindlll (1/2) 

Insert carrying bla gene 
7563 gcaggtg gcacttttcg gggaaatgtg cgcggaaccc 

7600 ctatttgttt atttttctaa atacattcaa atatGTATCC gctcatgaga caataaccct 

BciVI 



7660 gataaatgct tcaataatat tgaaaaAGGA AGAgt 

RBS . ? . . . 



Start bla gene 



7695 


ATG 


agt 


att 


caa 


cat 


ttc cgt 


gtc 


gee 


ctt 


att 


ccc 


ttt 


ttt 


gcg 


gca 


ttt 


7746 


tgc 


ctt 


cct 


gtt 


ttt 


get cac 


cca 


gaa 


a eg 


ctg 


gtg 


aaa 


gta 


aaa 


gat 


get 


7797 


gaa 


gat 


cag 


ttg 


ggC 


gCA CGA Gtg 


ggt 


tac 


ate 


gaa 


ctg 


gat 


etc 


aac 


age 














BssSI . 
































ApaLI removed 




















7848 


ggt 


aag 


ate 


ctt 


gag 


agt ttt 


cgc 


ccc 


gaa 


gaa 


cgt 


ttt 


cca 


atg 


atg 


age 


7899 


act 


ttt 


aaa 


gtt 


ctg 


eta tcrt 


cat 


aca 


eta 


tta 


tec 


cgt 


att 


gac 


gee 


ggg 


7950 


caa 


gaG 


CAA 


CTC 


GGT 


CGc egg 


gcg 


egg 


tat 


tct 


cag 


aat 


gac 


ttg 


gtt 


gAG 






Bcgl 


























SCc 


8001 


TAC 


Tea 


cca 


gtc 


aca 


gaa aag 


cat 


ctt 


acg 


gat 


ggc 


atg 


aca 


gta 


aga 


gaa 




Seal 


































tta 


tgc 


agt 


get 


gec 


ata acc 


atg 


agt 


gat 


aac 


act 


gcg 


gee 


aac 


tta 


ctt 


8103 


ctg 


aca 


aCG 


ATC 


Gga 


gga ccg 


aag 


gag 


eta 


acc 


get 


ttt 


ttg 


cac 


aac 


atg 








Pvul 


























8154 


ggg 


gat 


cat 


gta 


act 


cgc ctt 


gat 


cgt 


tgg 


gaa 


ccg 


gag 


ctg 


aat 


gaa 


gec 


8205 


ata 


cca 


aac 


gac 


gag 


cgt gac 


acc 


a eg 


atg 


cct 


gta 


gca 


atg 


cca 


aca 


acg 


8256 


tTG 


CGC 


Aaa 


eta 


tta 


act ggc 


gaa 


eta 


ctt 


act 


eta 


get 


tec 


egg 


caa 


caa 




Fspl . . . 






























8307 


tta 


ata 


gac 


tgg 


atg 


gag gcg 


gat 


aaa 


gtt 


gca 


gga 


cca 


ctt 


ctg 


cgc 


teg 


8358 


GCC 


ctt 


ccG 


GCt 


ggc 


tgg ttt 


att 


get 


gat 


aaa 


tct 


gga 


gee 


ggt 


gag 


cgt 




Bgll 






























8409 


gGG 


TCT 


Cgc 


ggt 


ate 


att gca 


gca 


ctg 


ggg 


cca 


gat 


ggt 


aag 


ccc 


tec 


cgt 




Bsal 






























8460 


ate 


gta 


gtt 


ate 


tac 


acG ACg 


ggg 


aGT 


Cag 


gca 


act 


atg 


gat 


gaa 


cga 


aat 














Ahdl 






















8511 


aga 


cag 


ate 


get 


gag 


ata ggt 


gee 


tea 


ctg 


att 


aag 


cat 


tgg 


TAA 


ctgt 



stop 

8560 cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 
8620 ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 
8680 cgttccactg taegtaagae cccc 

8704 AAGCTT GTC GAC tgaa tggcgaatgg cgctttgcct 
Hindlll Sail.. 
(2/2) Hindi 

8740 ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgegatett 



8790 


CCTGAGG 
Bsu36I_ 












8797 


ccgat 


actgtcgtcg 


tcccctcaaa 


ctggcagatg 






8832 


cacggttacg 


atgcgcccat 


ctacaccaac 


gtaacctatc 


ccattacggt 


caatccgccg 


8892 


tttgttccca 


cggagaatcc 


qacqggttqt 


tactcgctca 


catttaatgt 


tgatgaaagc 


8952 


tggctacagg 


aaggccagac 


gcgaattatt 


tttgatggcg 


ttcctattgg 


ttaaaaaatg 


9012 


agctgattta 


acaaaaattt 


aacgcgaatt 


ttaacaaaat 


attaacgttt 


acaATTTAAA 
Swal . . . 


9072 


Tatttgctta 


tacaatcttc 


ctgtttttgg 


ggcttttctg 


attatcaacc 


GGGGTAcat 



RBS? 



9131 


ATG 


att 


gac 


atg 


eta 


gtt 


tta 


cga 


tta 


ccg 


ttc 


ate 


gat 


tct 


ctt 


gtt 


tgc 




Start gene II 




























9182 


tec 


aga 


etc 


tea 


ggc 


aat 


gac 


ctg 


ata 


gee 


ttt 


gtA 


GAT 


CTc 


tea 


aaa 


ata 


























Bglll. 










9233 


get 


acc 


etc 


tec 


ggc 


atg 


aat 


tta 


tea 


get 


aga 


acg 


gtt 


gaa 


tat 


cat 


att 


9284 


gat 


ggt 


gat 


ttg 


act 


gtc 


tec 


ggc 


ctt 


tct 


cac 


cct 


ttt 


gaa 


tct 


tta 


cct 


9335 


aca 


cat 


tac 


tea 


ggc 


att 


gca 


ttt 


aaa 


ata 


tat 


gag 


ggt 


tct 


aaa 


aat 


ttt 


9386 


tat 


cct 


tgc 


gtt 


gaa 


ata 


aag 


get 


tct 


ccc 


gca 


aaa 


gta 


tta 


cag 


ggt 


cat 


9437 


aat 


gtt 


ttt 


ggt 


aca 


acc 


gat 


tta 


get 


tta 


tgc 


tct 


gag 


get 


tta 


ttg 


ctt 


9488 


aat 


ttt 


get 


aat 


tct 


ttg 


cct 


tgc 


ctg 


tat 


gat 


tta 


ttg 


gat 


gtt 


! 9532 



gene II continues 




Table 12 OB: Sequence of MALIA3, condensed 

LOCUS MALI A3 9532 CIRCULAR 

ORIGIN 







l 


AATGCTACTA 


CTATTAGTAG 


AATTGATGCC 


ACCTTTTCAG 


CTCGCGCCCC 


AAATGAAAAT 




5 


61 


ATAGCTAAAC 


AGGTTATTGA 


CCATTTGCGA 


AATGTATCTA 


ATGGTCAAAC 


TAAATCTACT 






121 


CGTTCGCAGA 


ATTGGGAATC 


AACTGTTACA 


TGGAATGAAA 


CTTCCAGACA 


CCGTACTTTA 






181 


GTTGCATATT 


TAAAACATGT 


TGAGCTACAG 


CACCAGATTC 


AGCAATTAAG 


CTCTAAGCCA 






241 


TCCGCAAAAA 


TGACCTCTTA 


TCAAAAGGAG 


CAATTAAAGG 


TACTCTCTAA 


TCCTGACCTG 






301 


TTGGAGTTTG 


CTTCCGGTCT 


GGTTCGCTTT 


GAAGCTCGAA 


TTAAAACGCG 


ATATTTGAAG 




10 


361 


TCTTTCGGGC 


TTCCTCTTAA 


TCTTTTTGAT 


GCAATCCGCT 


TTGCTTCTGA 


CTATAATAGT 






421 


CAGGGTAAAG 


ACCTGATTTT 


TGATTTATGG 


TCATTCTCGT 


TTTCTGAACT 


GTTTAAAGCA 






481 


TTTGAGGGGG 


ATTCAATGAA 


TATTTATGAC 


GATTCCGCAG 


TATTGGACGC 


TATCCAGTCT 






541 


AAACATTTTA 


CTATTACCCC 


CTCTGGCAAA 


ACTTCTTTTG 


CAAAAGCCTC 


TCGCTATTTT 






601 


GGTTTTTATC 


GTCGTCTGGT 


AAACGAGGGT 


TATGATAGTG 


TTGCTCTTAC 


TATGCCTCGT 




15 


661 


AATTCCTTTT 


GGCGTTATGT 


ATCTGCATTA 


GTTGAATGTG 


GTATTCCTAA 


ATCTCAACTG 


□ 




721 


ATGAATCTTT 


CTACCTGTAA 


TAATGTTGTT 


CCGTTAGTTC 


GTTTTATTAA 


CGTAGATTTT 




781 


TCTTCCCAAC 


GTCCTGACTG 


GTATAATGAG 


CCAGTTCTTA 


AAATCGCATA 


AGGTAATTCA 






841 


CAATGATTAA 


AGTTGAAATT 


AAACCATCTC 


AAGCCCAATT 


TACTACTCGT 


TCTGGTGTTT 


w 




901 


CTCGTCAGGG 


CAAGCCTTAT 


TCACTGAATG 


AGCAGCTTTG 


TTACGTTGAT 


TTGGGTAATG 


w 

S K, 


20 


961 


AATATCCGGT 


TCTTGTCAAG 


ATTACTCTTG 


ATGAAGGTCA 


GCCAGCCTAT 


GCGCCTGGTC 




1021 


TGTACACCGT 


TCATCTGTCC 


TCTTTCAAAG 


TTGGTCAGTT 


CGGTTCCCTT 


ATGATTGACC 


in 




1081 


GTCTGCGCCT 


CGTTCCGGCT 


AAGTAACATG 


GAGCAGGTCG 


CGGATTTCGA 


CACAATTTAT 






1141 


CAGGCGATGA 


TACAAATCTC 


CGTTGTACTT 


TGTTTCGCGC 


TTGGTATAAT 


CGCTGGGGGT 


(3 

3 S3 




1201 


CAAAGATGAG 


TGTTTTAGTG 


TATTCTTTCG 


CCTCTTTCGT 


TTTAGGTTGG 


TGCCTTCGTA 




25 


1261 


GTGGCATTAC 


GTATTTTACC 


CGTTTAATGG 


AAACTTCCTC 


ATGAAAAAGT 


CTTTAGTCCT 


VI 

■a a? 




1321 


CAAAGCCTCT 


GTAGCCGTTG 


CTACCCTCGT 


TCCGATGCTG 


TCTTTCGCTG 


CTGAGGGTGA 




1381 


CGATCCCGCA 


AAAGCGGCCT 


TTAACTCCCT 


GCAAGCCTCA 


GCGACCGAAT 


ATATCGGTTA 






1441 


TGCGTGGGCG 


ATGGTTGTTG 


TCATTGTCGG 


CGCAACTATC 


GGTATCAAGC 


TGTTTAAGAA 






1501 


ATTCACCTCG 


AAAGCAAGCT 


GATAAACCGA 


TACAATTAAA 


GGCTCCTTTT 


GGAGCCTTTT 




30 


1561 


TTTTTGGAGA 


TTTTCAACGT 


GAAAAAATTA 


TTATTCGCAA 


TTCCTTTAGT 


TGTTCCTTTC 






1621 


TATTCTCACA 


GTGCACAGTC 


TGTCGTGACG 


CAGCCGCCCT 


CAGTGTCTGG 


GGCCCCAGGG 






lUOl 


PAftAftflfyrnA 




r a n t nf^n a ft n 


A (ZCT C C A A C A 




TTATGATGTA 






1741 


CACTGGTACC 


AGCAGCTTCC 


AGGAACAGCC 


CCCAAACTCC 


T CATC TAT GG 


TAACAGCAAT 






1801 


CGGCCCTCAG 


GGGTCCCTGA 


CCGATTCTCT 


GGCTCCAAGT 


CTGGCACCTC 


AGCCTCCCTG 




35 


1861 


GCCATCACTG 


GGCTCCAGGC 


TGAGGATGAG 


GCTGATTATT 


ACTGCCAGTC 


CTATGACAGC 






1921 


AGCCTGAGTG 


GCCTTTATGT 


CTTCGGAACT 


GGGACCAAGG 


TCACCGTCCT 


AGGTCAGCCC 






1981 


AAGGCCAACC 


CCACTGTCAC 


TCTGTTCCCG 


CCCTCCTCTG 


AGGAGCTCCA 


AGCCAACAAG 






2041 


GCCACACTAG 


TGTGTCTGAT 


CAGTGACTTC 


TACCCGGGAG 


CTGTGACAGT 


GGCCTGGAAG 






2101 


GCAGATAGCA 


GCCCCGTCAA 


GGCGGGAGTG 


GAGACCACCA 


CACCCTCCAA 


ACAAAGCAAC 








2161 


AACAAGTACG 


CGGCCAGCAG 


CTATCTGAGC 


CTGACGCCTG 


AGCAGTGGAA 


GTCCCACAGA 






2221 


AGCTACAGCT 


GCCAGGTCAC 


GCATGAAGGG 


AGCACCGTGG 


AGAAGACAGT 


GGCCCCTACA 






2281 


GAATGTTCAT 


AATAAACCGC 


CTCCACCGGG 


CGCGCCAATT 


CTATTTCAAG 


GAGACAGTCA 






2341 


TAATGAAATA 


CCTATTGCCT 


ACGGCAGCCG 


CTGGATTGTT 


ATTACTCGCG 


GCCCAGCCGG 




5 


2401 


CCATGGCCGA 


AGTTCAATTG 


TTAGAGTCTG 


GTGGCGGTCT 


TGTTCAGCCT 


GGTGGTTCTT 






2461 


TACGTCTTTC 


TTGCGCTGCT 


TCCGGATTCA 


CTTTCTCTTC 


GTACGCTATG 


TCTTGGGTTC 






2521 


GCCAAGCTCC 


TGGTAAAGGT 


TTGGAGTGGG 


TTTCTGCTAT 


CTCTGGTTCT 


GGTGGCAGTA 






2581 


CTTACTATGC 


TGACTCCGTT 


AAAGGTCGCT 


TCACTATCTC 


TAGAGACAAC 


TCTAAGAATA 






2641 


CTCTCTACTT 


GCAGATGAAC 


AGCTTAAGGG 


CTGAGGACAC 


TGCAGTCTAC 


TATTGCGCTA 




10 


2701 


AAGACTATGA 


AGGTACTGGT 


TATGCTTTCG 


ACATATGGGG 


TCAAGGTACT 


ATGGTCACCG 






2761 


TCTCTAGTGC 


CTCCACCAAG 


GGCCCATCGG 


TCTTCCCCCT 


GGCACCCTCC 


TCCAAGAGCA 






2821 


CCTCTGGGGG 


CACAGCGGCC 


CTGGGCTGCC 


TGGTCAAGGA 


CTACTTCCCC 


GAACCGGTGA 






2881 


CGGTGTCGTG 


GAACTCAGGC 


GCCCTGACCA 


GCGGCGTCCA 


CACCTTCCCG 


GCTGTCCTAC 






2941 


AGTCTAGCGG 


ACTCTACTCC 


CTCAGCAGCG 


TAGTGACCGT 


GCCCTCTTCT 


AGCTTGGGCA 




15 


3001 


CCCAGACCTA 


CATCTGCAAC 


GTGAATCACA 


AGCCCAGCAA 


CACCAAGGTG 


GACAAGAAAG 






3061 


TTGAGCCCAA 


ATCTTGTGCG 


GCCGCTCATC 


ACCACCATCA 


TCACTCTGCT 


GAACAAAAAC 


£3 




3121 


TCATCTCAGA 


AGAGGATCTG 


AATGGTGCCG 


CAGATATCAA 


CGATGATCGT 


ATGGCTGGCG 






3181 


CCGCTGAAAC 


TGTTGAAAGT 


TGTTTAGCAA 


AACCCCATAC 


AGAAAATTCA 


TTTACTAACG 


IJ 




3241 


TCTGGAAAGA 


CGACAAAACT 


TTAGATCGTT 


ACGCTAACTA 


TGAGGGTTGT 


CTGTGGAATG 




20 


3301 


CTACAGGCGT 


TGTAGTTTGT 


ACTGGTGACG 


AAACTCAGTG 


TTACGGTACA 


TGGGTTCCTA 


£, J 




3361 


TTGGGCTTGC 


TATCCCTGAA 


AATGAGGGTG 


GTGGCTCTGA 


GGGTGGCGGT 


TCTGAGGGTG 


r nj 




3421 


GCGGTTCTGA 


GGGTGGCGGT 


ACTAAACCTC 


CTGAGTACGG 


TGATACACCT 


ATTCCGGGCT 


□ 




3481 


ATACTTATAT 


CAACCCTCTC 


GACGGCACTT 


ATCCGCCTGG 


TACTGAGCAA 


AACCCCGCTA 






3541 


ATCCTAATCC 


TTCTCTTGAG 


GAGTCTCAGC 


CTCTTAATAC 


TTTCATGTTT 


CAGAATAATA 




25 


3601 


GGTTCCGAAA 


TAGGCAGGGG 


GCATTAACTG 


TTTATACGGG 


CACTGTTACT 


CAAGGCACTG 


i s 




3661 


ACCCCGTTAA 


AACTTATTAC 


CAGTACACTC 


CTGTATCATC 


AAAAGCCATG 


TATGACGCTT 




3721 


ACTGGAACGG 


TAAATTCAGA 


GACTGCGCTT 


TCCATTCTGG 


CTTTAATGAA 


GATCCATTCG 




3781 


TTTGTGAATA 


TCAAGGCCAA 


TCGTCTGACC 


TGCCTCAACC 


TCCTGTCAAT 


GCTGGCGGCG 






3841 


GCTCTGGTGG 


TGGTTCTGGT 


GGCGGCTCTG 


AGGGTGGTGG 


CTCTGAGGGT 


GGCGGTTCTG 




30 


3901 


AGGGTGGCGG 


CTCTGAGGGA 


GGCGGTTCCG 


GTGGTGGCTC 


TGGTTCCGGT 


GATTTTGATT 






3961 


ATGAAAAGAT 


GGCAAACGCT 


AATAAGGGGG 


CTATGACCGA 


AAATGCCGAT 


GAAAACGCGC 






4021 


TACAGTCTGA 


CGCTAAAGGC 


AAACTTGATT 


CTGTCGCTAC 


TGATTACGGT 


GCTGCTATCG 






4081 


ATGGTTTCAT 


TGGTGACGTT 


TCCGGCCTTG 


CTAATGGTAA 


TGGTGCTACT 


GGTGATTTTG 






4141 


CTGGCTCTAA 


TTCCCAAATG 


GCTCAAGTCG 


GTGACGGTGA 


TAATTCACCT 


TTAATGAATA 




35 


4201 


ATTTCCGTCA 


AT AT T T AC CT 


TCCCTCCCTC 


AATCGGTTGA 


ATGTCGCCCT 


TTTGTCTTTA 






4261 


GCGCTGGTAA 


ACCATATGAA 


TTTTCTATTG 


ATTGTGACAA 


AATAAACTTA 


TTCCGTGGTG 






4321 


TCTTTGCGTT 


TCTTTTATAT 


GTTGCCACCT 


TTATGTATGT 


ATTTTCTACG 


TTTGCTAACA 






4381 


TACTGCGTAA 


TAAGGAGTCT 


TAATCATGCC 


AGTTCTTTTG 


GGTATTCCGT 


TATTATTGCG 






4441 


TTTCCTCGGT 


TTCCTTCTGG 


TAACTTTGTT 


CGGCTATCTG 


CTTACTTTTC 


TTAAAAAGGG 







4501 


CTTCGGTAAG 


ATAGCTATTG 


CTATTTCATT 


GTTTCTTGCT 


CTTATTATTG 


GGCTTAACTC 






4561 


AATTCTTGTG 


GGTTATCTCT 


CTGATATTAG 


CGCTCAATTA 


CCCTCTGACT 


TTGTTCAGGG 






4621 


TGTTCAGTTA 


ATTCTCCCGT 


CTAATGCGCT 


TCCCTGTTTT 


TATGTTATTC 


TCTCTGTAAA 






4681 


GGCTGCTATT 


TTCATTTTTG 


ACGTTAAACA 


AAAAATCGTT 


TCTTATTTGG 


ATTGGGATAA 




J 


4741 


ATAATATGGC 


TGTTTATTTT 


GTAACTGGCA 


AATTAGGCTC 


TGGAAAGACG 


CTCGTTAGCG 






4801 


TTGGTAAGAT 


TCAGGATAAA 


ATTGTAGCTG 


GGTGCAAAAT 


AGCAACTAAT 


CTTGATTTAA 






4861 


GGCTTCAAAA 


CCTCCCGCAA 


GTCGGGAGGT 


TCGCTAAAAC 


GCCTCGCGTT 


CTTAGAATAC 






4921 


CGGATAAGCC 


TTCTATATCT 


GATTTGCTTG 


CTATTGGGCG 


CGGTAATGAT 


TCCTACGATG 






4981 


AAAATAAAAA 


CGGCTTGCTT 


GTTCTCGATG 


AGTGCGGTAC 


TTGGTTTAAT 


ACCCGTTCTT 




JO 


5041 


GGAATGATAA 


GGAAAGACAG 


CCGATTATTG 


ATTGGTTTCT 


ACATGCTCGT 


AAATTAGGAT 






5101 


GGGATATTAT 


TTTTCTTGTT 


CAGGACTTAT 


CTATTGTTGA 


TAAACAGGCG 


CGTTCTGCAT 






5161 


TAGCTGAACA 


TGTTGTTTAT 


TGTCGTCGTC 


TGGACAGAAT 


TACTTTACCT 


TTTGTCGGTA 






5221 


CTTTATATTC 


TCTTATTACT 


GGCTCGAAAA 


TGCCTCTGCC 


TAAATTACAT 


GTTGGCGTTG 






5281 


TTAAATATGG 


CGATTCTCAA 


TTAAGCCCTA 


CTGTTGAGCG 


TTGGCTTTAT 


ACTGGTAAGA 




15 


5341 


ATTTGTATAA 


CGCATATGAT 


ACTAAACAGG 


CTTTTTCTAG 


TAATTATGAT 


TCCGGTGTTT 






5401 


ATTCTTATTT 


AACGCCTTAT 


TTATCACACG 


GTCGGTATTT 


CAAACCATTA 


AATTTAGGTC 


[3 




5461 


AGAAGAT GAA 


ATTAACTAAA 


ATATATTTGA 


AAAAGTTTTC 


TCGCGTTCTT 


TGTCTTGCGA 






5521 


TTGGATTTGC 


ATCAGCATTT 


ACATATAGTT 


ATATAACCCA 


ACCTAAGCCG 


GAGGTTAAAA 


3:1 
VJ 




5581 


AGGTAGTCTC 


TCAGACCTAT 


GATTTTGATA 


AATTCACTAT 


TGACTCTTCT 


CAGCGTCTTA 


20 


5641 


ATCTAAGCTA 


TCGCTATGTT 


TTCAAGGATT 


CTAAGGGAAA 


ATTAATTAAT 


AGCGACGATT 


i*i 




5701 


TACAGAAGCA 


AGGTTATTCA 


CTCACATATA 


TTGATTTATG 


TACTGTTTCC 


ATTAAAAAAG 


3. : S 




5761 


GTAATTCAAA 


TGAAATTGTT 


AAATGTAATT 


AATTTTGTTT 


TCTTGATGTT 


TGTTTCATCA 


3 

0 




5821 


TCTTCTTTTG 


CTCAGGTAAT 


TGAAATGAAT 


AATTCGCCTC 


TGCGCGATTT 


TGTAACTTGG 




5881 


TATTCAAAGC 


AATCAGGCGA 


ATCCGTTATT 


GTTTCTCCCG 


ATGTAAAAGG 


TACT GT TACT 




25 


5941 


GTATATTCAT 


CTGACGTTAA 


ACCTGAAAAT 


CTACGCAATT 


TCTTTATTTC 


TGTTTTACGT 


a _ a 




6001 


GCTAATAATT 


TTGATATGGT 


TGGTTCAATT 


CCTTCCATAA 


TTCAGAAGTA 


TAATCCAAAC 


J H 




6061 


AATCAGGATT 


AT ATT GAT GA 


ATTGCCATCA 


TCTGATAATC 


AGGAATATGA 


TGATAATTCC 




6121 


GCTCCTTCTG 


GTGGTTTCTT 


TGTTCCGCAA 


AATGATAATG 


TTACTCAAAC 


TTTTAAAATT 






6181 


AATAACGTTC 


GGGCAAAGGA 


TTTAATACGA 


GTTGTCGAAT 


TGTTTGTAAA 


GTCTAATACT 




30 


6241 


TCTAAATCCT 


CAAATGTATT 


ATCTATTGAC 


GGCTCTAATC 


TATTAGTTGT 


TTCTGCACCT 






6301 


AAAGATATTT 


TAGATAACCT 


TCCTCAATTC 


CTTTCTACTG 


TTGATTTGCC 


AACTGACCAG 






6361 


ATATTGATTG 


AGGGTTTGAT 


ATTTGAGGTT 


CAGCAAGGTG 


ATGCTTTAGA 


TTTTTCATTT 






^6421 


GCTGCTGGCT 


CTCAGCGTGG 


CACTGTTGCA 


GGCGGTGTTA 


ATACTGACCG 


CCTCACCTCT 






6481 


GTTTTATCTT 


CTGCTGGTGG 


TTCGTTCGGT 


ATTTTTAATG 


GCGATGTTTT 


AGGGCTATCA 




35 


6541 


GTTCGCGCAT 


TAAAGACTAA 


TAGCCATTCA 


AAAATATTGT 


CTGTGCCACG 


TATTCTTACG 






6601 


CTTTCAGGTC 


AGAAGGGTTC 


TATCTCTGTT 


GGCCAGAATG 


TCCCTTTTAT 


TACTGGTCGT 






6661 


GTGACTGGTG 


AATCTGCCAA 


TGTAAATAAT 


CCATTTCAGA 


CGATTGAGCG 


TCAAAATGTA 






6721 


GGTATTTCCA 


TGAGCGTTTT 


TCCTGTTGCA 


ATGGCTGGCG 


GTAATATTGT 


TCTGGATATT 






6781 


ACCAGCAAGG 


CCGATAGTTT 


GAGTTCTTCT 


ACTCAGGCAA 


GTGATGTTAT 


TACTAATCAA 








6841 


AGAAGTATTG 


CTACAACGGT 


TAATTTGCGT 


GATGGACAGA 


CTCTTTTACT 


CGGTGGCCTC 






6901 


ACT GAT TATA 


AAAACACTTC 


TCAAGATTCT 


GGCGTACCGT 


TCCTGTCTAA 


AATCCCTTTA 






6961 


ATCGGCCTCC 


TGTTTAGCTC 


CCGCTCTGAT 


TCCAACGAGG 


AAAGCACGTT 


ATACGTGCTC 






7021 


GTCAAAGCAA 


CCATAGTACG 


CGCCCTGTAG 


CGGCGCATTA 


AGCGCGGCGG 


GTGTGGTGGT 




5 


7081 


TACGCGCAGC 


GTGACCGCTA 


CACTTGCCAG 


CGCCCTAGCG 


CCCGCTCCTT 


TCGCTTTCTT 






7141 


CCCTTCCTTT 


CTCGCCACGT 


TCGCCGGCTT 


TCCCCGTCAA 


GCTCTAAATC 


GGGGGCTCCC 






7201 


TTTAGGGTTC 


CGATTTAGTG 


CTTTACGGCA 


CCTCGACCCC 


AAAAAACTTG. ATTTGGGTGA 






7261 


TGGTTCACGT 


AGTGGGCCAT 


CGCCCTGATA 


GACGGTTTTT 


CGCCCTTTGA 


CGTTGGAGTC 






7321 


CACGTTCTTT 


AATAGTGGAC 


TCTTGTTCCA 


AACTGGAACA 


ACACTCAACC 


CTATCTCGGG 




JO 


7381 


CTATTCTTTT 


GATTTATAAG 


GGATTTTGCC 


GATTTCGGAA 


CCACCATCAA 


ACAGGATTTT 






7441 


CGCCTGCTGG 


GGCAAACCAG 


CGTGGACCGC 


TTGCTGCAAC 


TCTCTCAGGG 


CCAGGCGGTG 






7501 


AAGGGCAATC 


AGCTGTTGCC 


CGTCTCACTG 


GTGAAAAGAA 


AAACCACCCT 


GGATCCAAGC 






7561 


TTGCAGGTGG 


CACTTTTCGG 


GGAAATGTGC 


GCGGAACCCC 


TATTTGTTTA 


TTTTTCTAAA 






7621 


TACATTCAAA 


TATGTATCCG 


CTCATGAGAC 


AATAACCCTG 


ATAAATGCTT 


CAATAATATT 




15 


7681 


GAAAAAGGAA 


GAGTATGAGT 


ATTCAACATT 


TCCGTGTCGC 


CCTTATTCCC 


TTTTTTGCGG 


p 




7741 


CATTTTGCCT 


TCCTGTTTTT 


GCTCACCCAG 


AAACGCTGGT 


GAAAGTAAAA 


GATGCTGAAG 




7801 


ATCAGTTGGG 


CGCACGAGTG 


GGTTACATCG 


AACTGGATCT 


CAACAGCGGT 


AAGATCCTTG 


CO 




7861 


AGAGTTTTCG 


CCCCGAAGAA 


CGTTTTCCAA 


TGATGAGCAC 


TTTTAAAGTT 


CTGCTATGTC 






7921 


ATACACTATT 


ATCCCGTATT 


GACGCCGGGC 


AAGAGCAACT 


CGGTCGCCGG 


GCGCGGTATT 


f i 

*s =■ 


20 


7981 


CTCAGAATGA 


CTTGGTTGAG 


TACTCACCAG 


TCACAGAAAA 


GCATCTTACG 


GATGGCATGA 




8041 


CAGTAAGAGA 


ATTATGCAGT 


GCTGCCATAA 


CCATGAGTGA 


TAACACTGCG 


GCCAACTTAC 


a, : s 




8101 


TTCTGACAAC 


GATCGGAGGA 


CCGAAGGAGC 


TAACCGCTTT 


TTTGCACAAC 


ATGGGGGATC 






8161 


ATGTAACTCG 


CCTTGATCGT 


TGGGAACCGG 


AGCTGAATGA 


AGCCATACCA 


AACGACGAGC 


O 




8221 


GTGACACCAC 


GATGCCTGTA 


GCAATGCCAA 


CAACGTTGCG 


CAAACTATTA ACTGGCGAAC 




25 


8281 


TACTTACTCT 


AGCTTCCCGG 


CAACAATTAA 


TAGACTGGAT 


GGAGGCGGAT 


AAAGTTGCAG 






8341 


GACCACTTCT 


GCGCTCGGCC 


CTTCCGGCTG 


GCTGGTTTAT 


T GCT GAT AAA 


TCTGGAGCCG 






8401 


GTGAGCGTGG 


GTCTCGCGGT 


ATCATTGCAG 


CACTGGGGCC 


AGATGGTAAG 


CCCTCCCGTA 






8461 


TCGTAGTTAT 


CTACACGACG 


GGGAGTCAGG 


CAACTATGGA 


TGAACGAAAT 


AGACAGATCG 






8521 


CTGAGATAGG 


TGCCTCACTG 


ATTAAGCATT 


GGTAACTGTC 


AGACCAAGTT 


TACTCATATA 




30 


8581 


TACTTTAGAT 


TGATTTAAAA 


CTTCATTTTT 


AATTTAAAAG 


GATCTAGGTG 


AAGATCCTTT 






8641 


TTGATAATCT 


CATGACCAAA 


ATCCCTTAAC 


GTGAGTTTTC 


GTTCCACTGT 


ACGTAAGACC 






8701 


CCCAAGCTTG 


TCGACTGAAT 


GGCGAATGGC 


GCTTTGCCTG 


GTTTCCGGCA 


CCAGAAGCGG 






8761 


TGCCGGAAAG 


CTGGCTGGAG 


TGCGATCTTC 


CTGAGGCCGA 


TACTGTCGTC 


GTCCCCTCAA 






8821 


ACTGGCAGAT 


GCACGGTTAC 


GATGCGCCCA 


TCTACACCAA 


CGTAACCTAT 


CCCATTACGG 




35 


8881 


TCAATCCGCC 


GTTTGTTCCC 


ACGGAGAATC 


CGACGGGTTG 


TTACTCGCTC 


ACATTTAATG 






8941 


TTGATGAAAG 


CTGGCTACAG 


GAAGGCCAGA 


CGCGAATTAT 


TTTTGATGGC 


GTTCCTATTG 






9001 


GTTAAAAAAT 


GAGCTGATTT 


AACAAAAATT 


TAACGCGAAT 


TTTAACAAAA 


TATTAACGTT 






9061 


TACAATTTAA 


ATATTTGCTT 


ATACAATCTT 


CCTGTTTTTG 


GGGCTTTTCT 


GATTATCAAC 






9121 


CGGGGTACAT 


ATGATTGACA 


TGCTAGTTTT 


ACGATTACCG 


TTCATCGATT 


CTCTTGTTTG 



9181 CTCCAGACTC TCAGGCAATG ACCTGATAGC 
9241 CTCCGGCATG AATTTATCAG CTAGAACGGT 
9301 CTCCGGCCTT TCTCACCCTT TTGAATCTTT 
9361 AATATATGAG GGTTCTAAAA ATTTTTATCC 
9421 AGTATTACAG GGTCATAATG TTTTTGGTAC 
94 81 ATTGCTTAAT TTTGCTAATT CTTTGCCTTG 



CTTTGTAGAT CTCTCAAAAA TAGCTACCCT 
TGAATATCAT ATTGATGGTG ATTTGACTGT 
ACCTACACAT TACTCAGGCA TTGCATTTAA 
TTGCGTTGAA ATAAAGGCTT CTCCCGCAAA 
AACCGATTTA GCTTTATGCT CTGAGGCTTT 
CCTGTATGAT TTATTGGATG TT 



Table 200: Enzymes that either cut 15 or more human GLGs or have 5+-base recognition in FR3 
Typical entry: 

REname Recognition #sites 

GLGidtf : base# GLGidtf : base# GLGid# : base# 



BstEII Ggtnacc 2 
1: 3 48: 3 
There are 2 hits at base# 3 



10 Maelll gtnac 36 





1: 


4 


2 : 


4 


3 : 


4 


4 : 


4 


5 : 


4 


6 : 


4 




7: 


4 


8 : 


4 


9 : 


4 


10 : 


4 


11 : 


4 


37 : 


4 




37: 


58 


38: 


4 


38: 


58 


39: 


4 


39: 


58 


40: 


4 




40: 


58 


41: 


4 


41: 


58 


42: 


4 


42: 


58 


43: 


4 


ID 


43: 


58 


44: 


4 


44 : 


c o 

58 


AC. 

45 : 


4 


AC. 

45 : 


c o 

58 


A C . 

46 : 


4 




46: 


58 


47: 


4 


47 : 


58 


48 : 


4 


49 : 


4 


50 : 


58 




There 


are 24 


hits 


at 


basel 


4 














Tsp45I 


gtsac 










33 










20 


1: 


4 


2: 


4 


3: 


4 


4: 


4 


5: 


4 


6: 


4 




7: 


4 


8: 


4 


9: 


4 


10: 


4 


11: 


4 


37: 


4 




37: 


58 


38: 


4 


38: 


58 


39: 


58 


40: 


4 


40: 


58 




41: 


58 


42: 


58 


43: 


4 


43: 


58 


44: 


4 


44: 


58 




45: 


4 


45: 


58 


46: 


4 


46: 


58 


47: 


4 


47: 


58 


25 


48: 


4 


49: 


4 


50: 


58 
















There 


are 21 


hits 


at 


baset 


4 














HphI 


tcacc 










45 












1: 


5 


2: 


5 


3: 


5 


4: 


5 


5: 


5 


6: 


5 


30 


7: 


5 


8: 


5 


11: 


5 


12: 


5 


12: 


11 


13: 


5 




14: 


5 


15: 


5 


16: 


5 


17: 


5 


18: 


5 


19: 


5 




20: 


5 


21: 


5 


22: 


5 


23: 


5 


24: 


5 


25: 


5 




26: 


5 


27: 


5 


28: 


5 


29: 


5 


30: 


5 


31: 


5 




32: 


5 


33: 


5 


34: 


5 


35: 


5 


36: 


5 


37: 


5 


35 


38: 


5 


40: 


5 


43: 


5 


44: 


5 


45: 


5 


46: 


5 




47: 


5 


48: 


5 


49: 


5 















There are 44 hits at base# 5 



Nlalll CATG 26 

1: 9 1: 42 2: 42 3: 9 3: 42 4: 9 

4: 42 5: 9 5: 42 6: 42 6: 78 7: 9 

7: 42 8: 21 8: 42 9: 42 10: 42 11: 42 

12: 57 13: 48 13: 57 14: 57 31: 72 38: 9 

48: 78 49: 78 

There are 11 hits at base# 42 

There are 1 hits at base# 48 Could cause raggedness 



BsaJI Ccnngg 










37 










1: 14 


2: 


14 


5: 


14 


6: 


14 


7: 


14 


8: 


14 


8: 65 


9: 


14 


10: 


14 


11: 


14 


12: 


14 


13: 


14 


14: 14 


15: 


65 


17: 


14 


17: 


65 


18: 


65 


19: 


65 


20: 65 


21: 


65 


22: 


65 


26: 


65 


29: 


65 


30: 


65 


33: 65 


34: 


65 


35: 


65 


37: 


65 


38: 


65 


39: 


65 


40: 65 


42: 


65 


43: 


65 


48: 


65 


49: 


65 


50: 


65 


51: 14 























There are 23 hits at base# 65 

There are 14 hits at base# 14 



Alul AGct 



1: 


47 


2: 


47 


7: 


47 


8: 


47 


23: 


63 


24: 


63 


37: 


47 


37: 


52 


40: 


47 


40: 


52 


43: 


47 


43: 


52 



42 



3: 


47 


4: 


47 


9: 


47 


10: 


47 


25: 


63 


31: 


63 


38: 


47 


38: 


52 


41: 


47 


41: 


52 


44: 


47 


44: 


52 



5: 


47 


6: 


47 


11: 


47 


16: 


63 


32: 


63 


36: 


63 


39: 


47 


39: 


52 


42: 


47 


42: 


52 


45: 


47 


45: 


52 



46: 47 46: 52 47: 47 47: 52 49: 15 50: 47 
There are 23 hits at base# 47 

There are 11 hits at base# 52 Only 5 bases from 47 



BlpI GCtnagc 21 

1: 48 2: 48 3: 48 5: 48 6: 48 7: 48 
8: 48 9: 48 10: 48 11: 48 37: 48 38: 48 

39: 48 40: 48 41: 48 42: 48 43: 48 44: 48 

45: 48 46: 48 47: 48 

There are 21 hits at base# 48 



Mwol GCNNNNNnngc 19 

1: 48 2: 28 19: 36 22: 36 23: 36 24: 36 
25: 36 26: 36 35: 36 37: 67 39: 67 40: 67 
41: 67 42: 67 43: 67 44: 67 45: 67 46: 67 
5 47: 67 

There are 10 hits at base# 67 
There are 7 hits at base# 36 



Ddel Ctnag 71 

10 1: 49 1: 58 2: 49 2: 58 3: 49 3: 58 

3: 65 4: 49 4: 58 5: 49 5: 58 5: 65 

6: 49 6: 58 6: 65 7: 49 7: 58 7: 65 

8: 49 8: 58 9: 49 9: 58 9: 65 10: 49 

10: 58 10: 65 11: 49 11: 58 11: 65 15: 58 

15 16: 58 16: 65 17: 58 18: 58 20: 58 21: 58 

22: 58 23: 58 23: 65 24: 58 24: 65 25: 58 



25 



30 



25: 65 26: 58 27: 58 27: 65 28: 58 30: 58 
31: 58 31: 65 32: 58 32: 65 35: 58 36: 58 



36: 65 37: 49 38: 49 39: 26 39: 49 40: 49 

20 41: 49 42: 26 42: 49 43: 49 44: 49 45: 49 

46: 49 47: 49 48: 12 49: 12 51: 65 
There are 29 hits at base# 58 

There are 22 hits at baseffi 49 Only nine base from 58 

There are 16 hits at base# 65 Only seven bases from 58 



Bglll Agatct 11 

1: 61 2: 61 3: 61 4: 61 5: 61 6: 61 
7: 61 9: 61 10: 61 11: 61 51: 47 

There are 10 hits at base# 61 

BstYI Rgatcy 12 

1: 61 2: 61 3: 61 4: 61 5: 61 6: 61 

7: 61 8: 61 9: 61 10: 61 11: 61 51: 47 

There are 11 hits at base# 61 



35 




Hpyl88l TCNga 17 

1: 64 2: 64 3: 64 4: 64 5: 64 6: 64 

7: 64 8: 64 9: 64 10: 64 11: 64 16: 57 

20: 57 27: 57 35: 57 48: 67 49: 67 

5 There are 11 hits at base# 64 

There are 4 hits at base# 57 

There are 2 hits at base# 67 Could be ragged. 



MslI CAYNNnnRTG 44 



s 

13 



JO 



1: 


72 


2: 


72 


3: 


72 


4: 


72 


5: 


72 


6: 


72 


7: 


72 


8: 


72 


9: 


72 


10: 


72 


11: 


72 


15: 


72 


17: 


72 


18: 


72 


19: 


72 


21: 


72 


23: 


72 


24: 


72 


25: 


72 


26: 


72 


28: 


72 


29: 


72 


30: 


72 


31: 


72 


32: 


72 


33: 


72 


34: 


72 


35: 


72 


36: 


72 


37: 


72 


38: 


72 


39: 


72 


40: 


72 


41: 


72 


42: 


72 


43: 


72 


44: 


72 


45: 


72 


46: 


72 


47: 


72 


48: 


72 


49: 


72 


50: 


72 


51: 


72 



















15 

£3 

%y ■ 

U There are 44 hits at base# 72 



p 20 BsiEI CGRYcg 23 



74 
74 
74 



1: 74 3: 74 4: 74 5: 74 7: 74 8 

9: 74 10: 74 11: 74 17: 74 22: 74 30 

*S '. 33: 74 34: 74 37: 74 38: 74 39: 74 40 

I* 41: 74 42: 74 45: 74 46: 74 47: 74 
p 25 There are 23 hits at base# 74 

** 

Eael Yggccr 23 

1: 74 3: 74 4: 74 5: 74 7: 74 8: 74 

9: 74 10: 74 11: 74 17: 74 22: 74 30: 74 

30 33: 74 34: 74 37: 74 38: 74 39: 74 40: 74 

41: 74 42: 74 45: 74 46: 74 47: 74 
There are 23 hits at base# 74 



35 



EagX Cggccg 23 
1: 74 3: 74 4: 74 5: 74 
9: 74 10: 74 11: 74 17: 74 



7: 74 8: 74 
22: 74 30: 74 



33: 74 34: 74 37: 74 38: 74 39: 74 40: 74 
41: 74 42: 74 45: 74 46: 74 47: 74 
There are 23 hits at base# 74 




5 Haelll GGcc 27 



10 



1: 


75 


3: 


75 


4: 


75 


5: 


75 


7: 


75 


8: 


75 


9: 


75 


10: 


75 


11: 


75 


16: 


75 


17: 


75 


20: 


75 


22: 


75 


30: 


75 


33: 


75 


34: 


75 


37: 


75 


38: 


75 


39: 


75 


40: 


75 


41: 


75 


42: 


75 


45: 


75 


46: 


75 


47: 


75 


48: 


63 


49: 


63 















There are 25 hits at base# 75 



Bst4CI ACNgt 65°C 







1: 


86 


2: 


86 




15 


7: 


34 


7: 


86 


s a 




12: 


86 


13: 


86 






16: 


86 


17: 


36 


|J. 




20: 


86 


21: 


36 


i 

"-■s: 

IJ 




24: 


86 


25: 


86 




20 


28: 


86 


29: 


86 






33: 


86 


34: 


86 


f *% 




38: 


86 


39: 


86 






44: 


86 


45: 


86 






50: 


86 


51: 


0 



fl 25 There are 51 



63 Sites There 



3: 


86 


4: 


86 


5: 


8: 


86 


9: 


86 


10: 


14: 


86 


15: 


36 


15: 


17: 


86 


18: 


86 


19: 


21: 


86 


22: 


0 


22: 


26: 


86 


27: 


53 


27: 


30: 


86 


31: 


86 


32: 


35: 


53 


35: 


86 


36: 


40: 


86 


41: 


86 


42: 


46: 


86 


47: 


86 


48: 


51: 


86 








at 


base* 86 


All 


the 



a third isoschismer 



86 


6: 


86 


86 


11: 


86 


86 


16: 


53 


86 


20: 


53 


86 


23: 


86 


86 


28: 


36 


86 


33: 


36 


86 


37: 


86 


86 


43: 


86 


86 


49: 


86 



r sites are well away 



HpyCH4III ACNgt 63 



1: 


86 


2: 


86 


3: 


86 


4: 


86 


5: 


86 


6: 


86 


7: 


34 


7: 


86 


8: 


86 


9: 


86 


10: 


86 


11: 


86 


12: 


86 


13: 


86 


14: 


86 


15: 


36 


15: 


86 


16: 


53 


16: 


86 


17: 


36 


17: 


86 


18: 


86 


19: 


86 


20: 


53 


20: 


86 


21: 


36 


21: 


86 


22: 


0 


22: 


86 


23: 


86 


24: 


86 


25: 


86 


26: 


86 


27: 


53 


27: 


86 


28: 


36 


28: 


86 


29: 


86 


30: 


86 


31: 


86 


32: 


86 


33: 


36 


33: 


86 


34: 


86 


35: 


53 


35: 


86 


36: 


86 


37: 


86 


38: 


86 


39: 


86 


40: 


86 


41: 


86 


42: 


86 


43: 


86 



44: 86 45: 86 46: 86 
50: 86 51: 0 51: 86 
There are 51 hits at base# 86 



47: 86 48: 86 



49: 86 



Hinf I Gantc 



43 



V 3 

W 

pas. 



5 



10 



15 



20 



35 



2: 


2 


3: 


2 


4: 


2 


5: 


2 


6: 


2 


7: 


2 


8: 


2 


9: 


2 


9: 


22 


10: 


2 


11: 


2 


15: 


2 


16: 


2 


17: 


2 


18: 


2 


19: 


2 


19: 


22 


20: 


2 


21: 


2 


23: 


2 


24: 


2 


25: 


2 


26: 


2 


27: 


2 


28: 


2 


29: 


2 


30: 


2 


31: 


2 


32: 


2 


33: 


2 


33: 


22 


34: 


22 


35: 


2 


36: 


2 


37: 


2 


38: 


2 


40: 


2 


43: 


2 


44: 


2 


45: 


2 


46: 


2 


47: 


2 


50: 


60 























There are 38 hits at base# 2 

Mlyl GAGTCNNNNNn 



18 



There are 18 hits at base# 2 
Pie I gagtc 



18 



There are 
There are 

There are 
There are 
There are 
There are 



8 hits at 
8 hits at 

3 hits at 
3 hits at 
1 hits at 
1 hits at 



base# 62 
base# 65 

base# 14 
base# 74 
base# 26 
base# 35 



2: 


2 


3: 


2 


4: 2 


5: 


2 


6: 


2 


7: 


2 


8: 


2 


9: 


2 


10: 2 


11: 


2 


37: 


2 


38: 


2 


40: 


2 


43: 


2 


44: 2 


45: 


2 


46: 


2 


47: 


2 







2: 


2 


3: 


2 


4: 


2 


5: 


2 


6: 


2 


7: 


2 


M 




8: 


2 


9: 


2 


10: 


2 


11: 


2 


37: 


2 


38: 


2 


f 1 


25 


40: 


2 


43: 


2 


44: 


2 


45: 


2 


46: 


2 


47: 


2 


f ==> 




There are 18 


hits 


at 


baseH 


2 
















Acil 


Ccgc 












24 














2: 


26 


9: 


14 


10: 


14 


11: 


14 


27: 


74 


37: 


62 






37: 


65 


38: 


62 


39: 


65 


40: 


62 


40: 


65 


41: 


65 




30 


42: 


65 


43: 


62 


43: 


65 


44: 


62 


44: 


65 


45: 


62 






46: 


62 


47: 


62 


47: 


65 


48: 


35 


48: 


74 


49: 


74 



-"- Gcgg 11 

8: 91 9: 16 10: 16 11: 16 
40: 67 42: 67 43: 67 45: 67 
There are 7 hits at base# 67 

There are 3 hits at base# 16 
There are 1 hits at base# 91 



37: 67 
46: 67 



39: 67 



BsiHKAI GWGCWc 



20 





2: 


30 


4: 


30 


6: 


30 


7: 


30 


9: 


30 


10: 


30 


10 


12: 


89 


13: 


89 


14: 


89 


37: 


51 


38: 


51 


39: 


51 




40: 


51 


41: 


51 


42: 


51 


43: 


51 


44: 


51 


45: 


51 




46: 


51 


47: 


51 



















There are 11 hits at base# 51 



15 Bspl286I GDGCHc 



20 



f i 




2: 30 


4: 30 


6: 


30 


7 : 


30 


9 : 


30 


10 : 


30 


•9 !=' 

i"=fc 
1 37 




12: 89 


13: 89 


14: 


89 


37: 


51 


38: 


51 


39: 


51 


r r= 

u 

W 




40: 51 
46: 51 


41: 51 
47: 51 


42: 


51 


43: 


51 


44: 


51 


45: 


51 


20 


There are 


11 hits 


at 


basefl 


51 












T '■ ' 




HgiAI GWGCWc 








20 










S, 3S. 
IJ 




2: 30 


4: 30 


6: 


30 


7: 


30 


9: 


30 


10: 


30 






12: 89 


13: 89 


14: 


89 


37: 


51 


38: 


51 


39: 


51 


3 : 

? —■ 

S.l 


25 


40: 51 


41: 51 


42: 


51 


43: 


51 


44: 


51 


45: 


51 


12 




46: 51 


47: 51 






















There are 


11 hits 


at 


base# 


51 
















BsoFI GCngc 








26 












30 


2: 53 


3: 53 


5: 


53 


6: 


53 


7: 


53 


8: 


53 






8: 91 


9: 53 


10: 


53 


11: 


53 


31: 


53 


36: 


36 






37: 64 


39: 64 


40: 


64 


41: 


64 


42: 


64 


43: 


64 






44: 64 


45: 64 


46: 


64 


47: 


64 


48: 


53 


49: 


53 






50: 45 


51: 53 




















35 


There are 


13 hits 


at 


base# 


53 













There are 10 hits at base# 64 



Tsel Gcwgc 17 
2: 53 3: 53 5: 53 6: 53 



7: 53 



8: 



53 



# 

9: 53 10: 53 11: 53 31: 53 36: 36 45: 64 
46: 64 48: 53 49: 53 50: 45 51: 53 
There are 13 hits at base# 53 




5 Mnll gagg 



10 



3: 


67 


3: 


95 


4: 


7: 


67 


8: 


67 


9: 


16: 


67 


17: 


67 


19: 


23: 


67 


24: 


67 


25: 


29: 


67 


30: 


67 


31: 


35: 


67 


36: 


67 


50: 



There are 31 hits at 
HpyCH4V TGca 



5: 


90 


6: 


90 


11: 


15: 


44 


16: 


44 


16: 


20: 


44 


21: 


44 


22: 


26: 


44 


27: 


44 


27: 


34: 


44 


35: 


44 


35: 


50: 


44 


50: 


90 


51: 



a There are 21 hits at 

^ There are 1 hits at 



34 



51 5: 


16 


5: 


67 


6: 


67 


67 10: 


67 


11: 


67 


15: 


67 


67 20: 


67 


21: 


67 


22: 


67 


67 26: 


67 


27: 


67 


28: 


67 


67 32: 


67 


33: 


67 


34: 


67 


67 51: 


67 










base# 67 














34 










90 12: 


90 


13: 


90 


14: 


90 


90 17: 


44 


18: 


90 


19: 


44 


44 23: 


44 


24: 


44 


25: 


44 


90 28: 


44 


29: 


44 


33: 


44 


90 36: 


38 


48: 


44 


49: 


44 


44 51: 


52 











base# 44 

base# 52 



*tj AccI GTmkac 13 5-base recognition 

£3. 25 7: 37 11: 24 37: 16 38: 16 39: 16 40: 16 

* fh 41: 16 42: 16 43: 16 44: 16 45: 16 46: 16 

47: 16 

There are 11 hits at base# 16 

30 SacII CCGCgg 8 6-base recognition 

9: 14 10: 14 11: 14 37: 65 39: 65 40: 65 
42: 65 43: 65 

There are 5 hits at base# 65 
There are 3 hits at base# 14 



35 



Tfil Gawtc 24 

9: 22 15: 2 16: 2 17: 2 18: 2 19: 2 
19: 22 20: 2 21: 2 23: 2 24: 2 25: 2 



26: 2 27: 2 28: 2 29: 2 30: 2 31: 2 
32: 2 33: 2 33: 22 34: 22 35: 2 36: 2 
There are 20 hits at base# 2 

5 BsmAI Nnnnnngagac 19 

15: 11 16: 11 20: 11 21: 11 22: 11 23: 11 
24: 11 25: 11 26: 11 27: 11 28: 11 28: 56 
30: 11 31: 11 32: 11 35: 11 36: 11 44: 87 
48: 87 

JO There are 16 hits at base# 11 

Bpml ctccag 19 

15: 12 16: 12 17: 12 18: 12 20: 12 21: 12 

22: 12 23: 12 24: 12 25: 12 26: 12 27: 12 

75 28: 12 30: 12 31: 12 32: 12 34: 12 35: 12 

36: 12 

|3 

%3 There are 19 hits at base# 12 

? ft 

y . 

%j XmnI GAANNnnttc 12 

|j 20 37: 30 38: 30 39: 30 40: 30 41: 30 42: 30 

O 43: 30 44: 30 45: 30 46: 30 47: 30 50: 30 

* There are 12 hits at base# 30 



*~ BsrI NCcagt 12 

I* 25 37: 32 38: 32 39: 32 40: 32 41: 32 42: 32 

[1 43: 32 44: 32 45: 32 46: 32 47: 32 50: 32 

,U There are 12 hits at base# 32 



Banll GRGCYc 11 
30 37: 51 38: 51 39: 51 40: 51 41: 51 42: 51 
43: 51 44: 51 45: 51 46: 51 47: 51 
There are 11 hits at base# 51 

Ecll36I GAGctc 11 

35 37: 51 38: 51 39: 51 40: 51 41: 51 42: 51 

43: 51 44: 51 45: 51 46: 51 47: 51 

There are 11 hits at base# 51 



SacI GAGCTc 



11 




37: 51 38: 51 39: 51 40: 51 41: 51 42: 51 
43: 51 44: 51 45: 51 46: 51 47: 51 
There are 11 hits at base# 51 



□ 

m 
w 

M. 
iy 
ea- 
rn. 

5 

13 

i* 

Q 



Table 217: Human HC GLG FR1 Sequences 

VH Exon - Nucleotide sequence alignment 

VH1 



1- 


-02 


CAG 


GTG 


CAG 


CTG 


GTG 


CAG 


TCT 


GGG 


GCT 






GTC 


TCC 


TGC 


AAG 


GCT 


TCT 


GGA 


TAC 


ACC 


1- 


■03 


cag 


gtC 


cag 


ctT 


gtg 


cag 


tct 


ggg 


get 






gtT 


tec 


tgc 


aag 


get 


tct 


gga 


tac 


acc 


1- 


-08 


cag 


gtg 


cag 


ctg 


gtg 


cag 


tct 


ggg 


get 






gtc 


tec 


tgc 


aag 


get 


tct 


gga 


tac 


acc 


1- 


■18 


cag 


gtT 


cag 


ctg 


gtg 


cag 


tct 


ggA 


get 






gtc 


tec 


tgc 


aag 


get 


tct 


ggT 


tac 


acc 


1- 


24 


cag 


gtC 


cag 


ctg 


gtA 


cag 


tct 


ggg 


get 






gtc 


tec 


tgc 


aag 


gTt 


tcC 


gga 


tac 


acc 


1- 


■45 


cag Atg 


cag 


ctg 


gtg 


cag 


tct 


ggg 


get 






gtT 


tec 


tgc 


aag 


get 


tcC 


gga 


tac 


acc 


1- 


46 


cag 


gtg 


cag 


ctg 


gtg 


cag 


tct 


ggg 


get 






gtT 


tec 


tgc 


aag 


gcA 


tct 


gga 


tac 


acc 


1- 


■58 


caA Atg 


cag 


ctg 


gtg 


cag 


tct 


ggg 


Cct 






gtc 


tec 


tgc 


aag 


get 


tct 


gga 


tTc 


acc 


1- 


■69 


cag 


gtg 


cag 


ctg 


gtg 


cag 


tct 


ggg 


get 






gtc 


tec 


tgc 


aag 


get 


tct 


gga 


GGc 


acc 


1- 


e 


cag 


gtg 


cag 


ctg 


gtg 


cag 


tct 


ggg 


get 






gtc 


tec 


tgc 


aag 


get 


tct 


gga 


GGc 


acc 


1- 


f 


Gag 


gtC 


cag 


ctg 


gtA 


cag 


tct 


ggg 


get 






Ate 


tec 


tgc 


aag 


gTt 


tct 


gga 


tac 


acc 


VH2 




















2- 


05 


CAG 


ATC 


ACC 


TTG 


AAG 


GAG 


TCT 


GGT 


CCT 






CTG 


ACC 


TGC 


ACC 


TTC 


TCT 


GGG 


TTC 


TCA 


2- 


26 


cag 


Gtc 


acc 


ttg 


aag 


gag 


tct 


ggt 


cct 






ctg 


ace 


tgc 


acc 


Gtc 


tct 


ggg 


ttc 


tea 


2- 


70 


cag 


Gtc 


acc 


ttg 


aag 


gag 


tct 


ggt 


cct 






ctg 


ace 


tgc 


acc 


ttc 


tct 


ggg 


ttc 


tea 


VH3 




















3- 


07 


GAG 


GTG 


CAG 


CTG 


GTG 


GAG 


TCT 


GGG 


GGA 






CTC 


TCC 


TGT 


GCA 


GCC 


TCT 


GGA 


TTC 


ACC 


3- 


09 


gaA gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 






etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


3- 


11 


Cag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 






etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


3- 


13 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 






etc 


tec 


tgt 


gca 


gec 


tct 


gga 


ttc 


acc 


3- 


15 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 






etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acT 


3- 


20 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 



GAG GTG AAG AAG CCT GGG GCC TCA GTG AAG 
TTC ACC 

gag gtg aag aag cct ggg gec tea gtg aag 
ttc acT 

gag gtg aag aag cct ggg gec tea gtg aag 
ttc acc 

gag gtg aag aag cct ggg gee tea gtg aag 
ttT acc 

gag gtg aag aag cct ggg gee tea gtg aag 
Ctc acT 

gag gtg aag aag Act ggg Tec tea gtg aag 
ttc acc 

gag gtg aag aag cct ggg gec tea gtg aag 
ttc acc 

gag gtg aag aag cct ggg Acc tea gtg aag 
ttT acT 

gag gtg aag aag cct ggg Tec tcG gtg aag 
ttc aGc 

gag gtg aag aag cct ggg Tec tcG gtg aag 
ttc aGc 

gag gtg aag aag cct ggg gcT Aca gtg aaA 
ttc acc 

ACG CTG GTG AAA CCC ACA CAG ACC CTC ACG 
CTC AGC 

GTg ctg gtg aaa ccc aca Gag acc ctc acg 
ctc age 

Gcg ctg gtg aaa ccc aca cag acc ctc acA 
ctc age 

GGC TTG GTC CAG CCT GGG GGG TCC CTG AGA 
TTT AGT 

ggc ttg gtA cag cct ggC Agg tec ctg aga 
ttt GAt 

ggc ttg gtc Aag cct ggA ggg tec ctg aga 
ttc agt 

ggc ttg gtA cag cct ggg ggg tec ctg aga 
ttC agt 

ggc ttg gtA Aag cct ggg ggg tec ctT aga 
ttc agt 

ggT Gtg gtA cGg cct ggg ggg tec ctg aga 











etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttt 


GAt 






















3- 


■21 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


Ctg 


gtc Aag 


cct 


ggg 


ggg 


tec 


ctg 


aga 










etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttc 


agt 






















3- 


-23 


gag 


gtg 


cag 


ctg 


Ttg 


gag 


tct 


ggg 


gga 


ggc 


ttg 


gtA 


cag 


cct 


ggg 


ggg 


tec 


ctg 


aga 




J 






etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttt 


agC 






















3- 


-30 


Cag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


Gtg 


gtc 


cag 


cct 


ggg Agg 


tec 


ctg 


aga 










etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttc 


agt 






















3- 


•30 . 


. 3 Cag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


Gtg 


gtc 


cag 


cct 


ggg Agg 


tec 


ctg 


aga 










etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttc 


agt 




















IU 


3- 


■30 . 


, 5 Cag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


Gtg 


gtc 


cag 


cct 


ggg Agg 


tec 


ctg 


aga 










etc 


tec 


tgt 


gca 


gec 


tct 


gga 


ttc 


acc 


ttc 


agt 






















3- 


•33 


Cag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


Gtg 


gtc 


cag 


cct 


ggg Agg 


tec 


ctg 


aga 










etc 


tec 


tgt 


gca 


gcG 


tct 


gga 


ttc 


acc 


ttc 


agt 






















3- 


■43 


gaA 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


gTc 


Gtg 


gtA 


cag 


cct 


ggg 


ggg 


tec 


ctg 


aga 




1 c 

u 






etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttt 


GAt 






















3- 


■48 


gag 
etc 


gtg 
tec 


cag 
tgt 


ctg 
gca 


gtg 
gee 


gag 
tct 


tct 
gga 


ggg 

ttc 


gga 
acc 


ggc 

ttc 


ttg 
agt 




cag 


cct 


ggg 


ggg 


tec 


ctg 


aga 






3- 


■49 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


ttg 




cag 


ccA 


ggg 


Cgg 


tec 


ctg 


aga 


ti ss- 








etc 


tec 


tgt Aca 


gcT 


tct 


gga 


ttc 


acc 


ttt 


Ggt 




















in 


3- 


53 


gag 


gtg 


cag 


ctg 


gtg 


gag 


Act 


ggA 


gga 


ggc 


ttg 


7A +• 

AtC 


cag 


cct 


ggg 


ggg 


tec 


ctg 


aga 


| - | 








etc 


tec 


tgt 


gca 


gee 


tct 


ggG 


ttc 


acc 


GtC 


agt 


















* J 




3- 


64 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


ttg 


gtc 


cag 


cct 


ggg 


ggg 


tec 


ctg 


aga 


y 
0 








etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttc 


agt 


















p. 




J- 


bo 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


ttg 


gtc 


cag 


cct 


ggg 


ggg 


tec 


ctg 


aga 


3 


ZD 






etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


GtC 


agt 


















£3. 




3- 


72 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


ttg 


gtc 


cag 


cct 


ggA ggg 


tec 


ctg 


aga 










etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttc 


agt 






















3- 


73 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tct 


ggg 


gga 


ggc 


ttg 


gtc 


cag 


cct 


ggg 


ggg 


tec 


ctg 


aAa 


^ H 
a si 








etc 


tec 


tgt 


gca 


gee 


tct 


ggG 


ttc 


acc 


ttc 


agt 


















J ; 


30 


3- 


74 


gag 


gtg 


cag 


ctg 


gtg 


gag 


tcC 


ggg 


gga 


ggc 


ttA 


gtT 


cag 


cct 


ggg 


ggg 


tec 


ctg 


aga 


M 








etc 


tec 


tgt 


gca 


gee 


tct 


gga 


ttc 


acc 


ttc 


agt 






















3- 


d 


gag 
etc 


gtg 
tec 


cag 
tgt 


ctg 
gca 


gtg 
gee 


gag 
tct 


tct 
gga 


Cgg 
ttc 


gga 
acc 


gTc 
GtC 


ttg 
agt 


gtA 


cag 


cct 


ggg 


ggg 


tec 


ctg 


aga 



35 



40 



VH4 



4-04 


CAG 


GTG 


CAG CTG 


CAG 


GAG 


TCG 


GGC 


CCA 


GGA 


CTG 


GTG 


AAG 


CCT 


TCG GGG 


ACC 


CTG 


TCC 




CTC 


ACC 


TGC GCT 


GTC 


TCT 


GGT 


GGC 


TCC 


ATC 


AGC 
















4-28 


cag 


gtg 


cag ctg 


cag 


gag 


teg 


ggc 


cca 


gga 


ctg 


gtg 


aag 


cct 


teg gAC 


acc 


ctg 


tec 




etc 


acc 


tgc get 


gtc 


tct 


ggt 


TAc 


tec 


ate 


age 
















4-30. 


1 cag 


gtg 


cag ctg 


cag 


gag 


teg 


ggc 


cca 


gga 


ctg 


gtg 


aag 


cct 


tcA CAg 


acc 


ctg 


tec 




etc 


acc 


tgc Act 


gtc 


tct 


ggt 


ggc 


tec 


ate 


age 
















4-30. 


2 cag 


Ctg 


cag ctg 


cag 


gag 


tcC 


ggc 


Tea 


gga 


ctg 


gtg 


aag 


cct 


tcA CAg 


acc 


ctg 


tec 




etc 


acc 


tgc get 


gtc 


tct 


ggt 


ggc 


tec 


ate 


age 
















4-30. 


4 cag 


gtg 


cag ctg 


cag 


gag 


teg 


ggc 


cca 


gga 


ctg 


gtg 


aag 


cct 


tcA CAg 


acc 


ctg 


tec 




etc 


acc 


tgc Act 


gtc 


tct 


ggt 


ggc 


tec 


ate 


age 

















4-31 


cag 


gtg 


cag 


ctg 


cag 


gag 


teg 


ggc 


cca 


gga 


ctg 


gtg 


aag 


cct 


tcA CAg 


acc 


ctg 


tec 




etc 


ace 


tgc 


Act 


gtc 


tct 


ggt 


ggc 


tec 


ate 


age 


















4-34 


cag 


gtg 


cag 


ctA 


cag 


Cag 


tGg 


ggc 


Gca 


gga 


ctg 


Ttg 


aag 


cct 


teg 


gAg 


acc 


ctg 


tec 




etc 


ace 


tgc 


get 


gtc 


tAt 


ggt 


ggG 


tec 


Ttc 


agT 


















4-39 


cag 


Ctg 


cag 


ctg 


cag 


gag 


teg 


ggc 


cca 


gga 


ctg 


gtg 


aag 


cct 


teg 


gAg 


acc 


ctg 


tec 




etc 


ace 


tgc 


Act 


gtc 


tct 


ggt 


ggc 


tec 


ate 


age 


















4-59 


cag 


gtg 


cag 


ctg 


cag 


gag 


teg 


ggc 


cca 


gga 


ctg 


gtg 


aag 


cct 


teg 


gAg 


acc 


ctg 


tec 




etc 


acc 


tgc 


Act 


gtc 


tct 


ggt 


ggc 


tec 


ate 


agT 












- 






4-61 


cag 


gtg 


cag 


ctg 


cag 


gag 


teg 


ggc 


cca 


gga 


ctg 


gtg 


aag 


cct 


teg 


gAg 


acc 


ctg 


tec 




etc 


acc 


tgc 


Act 


gtc 


tct 


ggt 


ggc 


tec 


Gtc 


age 


















4-b 


cag 


gtg 


cag 


ctg 


cag 


gag 


teg 


ggc 


cca 


gga 


ctg 


gtg 


aag 


cct 


teg 


gAg 


acc 


ctg 


tec 




etc 


acc 


tgc 


get 


gtc 


tct 


ggt 


TAc 


tec 


ate 


age 


















VH5 








































5-51 


GAG 


GTG 


CAG 


CTG 


GTG 


CAG 


TCT 


GGA 


GCA 


GAG 


GTG 


AAA 


AAG 


CCC 


GGG 


GAG 


TCT 


CTG 


AAG 




ATC 


TCC 


TGT 


AAG 


GGT 


TCT 


GGA 


TAC 


AGC 


TTT 


ACC 


















5-a 


gaA 


gtg 


cag 


ctg 


gtg 


cag 


tct 


gga 


gca 


gag 


gtg 


aaa 


aag 


ccc 


ggg 


gag 


tct 


ctg 


aGg 




ate 


tec 


tgt 


aag 


ggt 


tct 


gga 


tac 


age 


ttt 


acc 


















VH6 








































D 1 




G1A 


par* 




far 


par 




GG1 


rr r 


rrn 


CTG 


GTG 


AAG 


CCC 


TCG 


CAG 


ACC 


CTC 


TCA 




CTC 


ACC 


TGT 


GCC 


ATC 


TCC 


GGG 


GAC 


AGT 


GTC 


TCT 


















VH7 








































7-4.1 


CAG 


GTG 


CAG 


CTG 


GTG 


CAA 


TCT 


GGG 


TCT 


GAG 


TTG 


AAG 


AAG 


CCT 


GGG 


GCC 


TCA 


GTG 


AAG 




GTT 


TCC 


TGC 


AAG 


GCT 


TCT 


GGA 


TAC 


ACC 


TTC 


ACT 



















Table 220: RERS sites in Human HC GLG FRls where there are at least 20 GLGs cut 



10 



15 



Bsgl 


GTGCAG 










71 


(cuts 


16/14 


bases 


1: 


4 


1: 


13 


2: 


13 


3: 


4 


3: 


13 


4: 13 


6: 


13 


7: 


4 


7: 


13 


8: 


13 


9: 


4 


9: 13 


10: 


4 


10: 


13 


15: 


4 


15: 


65 


16: 


4 


16: 65 


17: 


4 


17: 


65 


18: 


4 


18: 


65 


19: 


4 


19: 65 


20: 


4 


20: 


65 


21: 


4 


21: 


65 


22: 


4 


22: 65 


23: 


4 


23: 


65 


24: 


4 


24: 


65 


25: 


4 


25: 65 


26 : 


4 


26: 


\j -j 


?7 • 








2ft • 


4 
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There are 34 hits at base# 16 
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There are 21 hits at base# 77 
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There are 34 hits at base# 16 
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75 33: 56 35: 56 36: 56 

There are 21 hits at base# 56 
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?=l There are 34 hits at base# 16 

-"- gactc 21 

15: 56 16: 56 17: 56 18: 56 19: 56 20: 56 

21: 56 22: 56 23: 56 24: 56 25: 56 26: 56 
30 27: 56 28: 56 29: 56 30: 56 31: 56 32: 56 

33: 56 35: 56 36: 56 

There are 21 hits at base# 56 
AlwNI CAGNNNctg 26 

15: 68 16: 68 17: 68 18: 68 19: 68 20: 68 
35 21: 68 22: 68 23: 68 24: 68 25: 68 26: 68 

27: 68 28: 68 29: 68 30: 68 31: 68 32: 68 

33: 68 34: 68 35: 68 36: 68 39: 46 40: 46 

41: 46 42: 46 

There are 22 hits at base# 68 
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Seqs with both expected and unexpected.... 8 
Seqs with no sites 0 



Q 

CG 

iy 



Analysis repeated using only 8 best REdaptors 

5 Id Ntot 01234567.8 + 

1 301 78 101 54 32 16 9 10 1 0 281 102#1 ccgtgtattactgtgcgagaga 

2 493 69 155 125 73 37 14 11 3 6 459 103#2 ctgtgtattactgtgcgagaga 

3 189 52 45 38 23 18 5 4 1 3 176 108#3 ccgtgtattactgtgcgagagg 

4 127 29 23 28 24 10 6 5 2 0 114 323#22 ccgtatattactgtgcgaaaga 
JO 5 78 21 25 14 11 1 4 2 0 0 72 330#23 ctgtgtattactgtgcgaaaga 

6 79 15 17 25 8 11 1 2 0 0 76 439#44 ctgtgtattactgtgcgagaca 

7 43 14 15 5 5 3 0 1 0 0 42 551#48 ccatgtattactgtgcgagaca 

8 307 26 63 72 51 38 24 14 13 6 250 5a#49 ccatgtattactgtgcgaga 
1 102#1 ccgtgtattactgtgcgagaga ccgtgtattactgtgcgagaga 

15 2 103#2 ctgtgtattactgtgcgagaga .t 

3 108#3 ccgtgtattactgtgcgagagg g 

4 323#22 ccgtatattactgtgcgaaaga ....a a... 

5 330#23 ctgtgtattactgtgcgaaaga . t a... 

6 439#44 ctgtgtattactgtgcgagaca .t c. 

20 1 551#48 ccatgtattactgtgcgagaca ..a c. 

8 5a#49 ccatgtattactgtgcgagaAA ..a AA 

Seqs with the expected RE site only 1463 / 1617 

Seqs with only an unexpected site 0 

25 Seqs with both expected and unexpected. ... 7 
Seqs with no sites 0 
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m 


V0 


r- 
















[0 


u 




a > 


I 


1 




rH 


CM 


CM 


CM 


CM 


CM 


CM 


CM 


CM 




1 


i 


1 


1 


i 


i 
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X 
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in 


V£ 




















X 
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rH 


CM 










CM 


CM 




















rH 
































rH 


rH 


rH 


rH 


rH 


rH 






A 












































X 


1 






























00 


a\ 


O 


rH 


CM 
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X 
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CM 
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CO 


CO 


ro 


ro 


M 


o 
































O 


O 




o 


O 


o 




CM 


A 






























CM 


CM 




CM 


CM 


CM 


E 


X 


1 






























00 


a\ 




rH 


CM 


CO 


CQ 


X 


1 




1 


1 








1 


1 


1 


1 






i 




CM 


CM 




CO 


CO 


ro 




co 












CO 


CO 






ro 


CO 


ro 


CO 






CO 


CO 












*r 












*r 
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^ 


M 


X 
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o 






ro 




in 








00 




o 


rH 


CM 


ro 
















r- 1 


CM 






CM 


CM 


CM 


CM 






CM 


CM 


ro 


CO 


CO 
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to 


o 
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x 












































M 
















































CM 
































CM 












CQ 


CM 
































CM 












CQ 


X 
































00 












PQ 


x 






1 


| 








| 


1 










i 




CM 














ro 
































ro 


CO 


ro 


CO 


CO 


CO 




<r 






































*T 








X 
































00 


cn 


o 


rH 


CM 


CO 
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CM 


CM 


ro 


ro 


ro 


CO 




CM 












CM 


CM 


CM 


CM 


CM 


CM 


CM 


CM 


CM 






























*r 








■^r 


























X 
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o 


rH 


CM 


CO 




in 




r- 


















X 












rH 


CM 


CM 


CM 


CM 


CM 


CM 


CM 


CM 
















M 
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CM 












































CQ 


X 














































X 






1 
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<Xt 




cn 


a\ 


a\ 


<T\ 


a\ 


a\ 


cn 


cn 






a> 


o\ 


cn 


G\ 


as 


o\ 
















vo 


VD 


V£> 


VD 


VD 




vo 




\s> 




KD 




vo 


VD 


\D 












r- 


03 




a\ 


o 


rH 


CM 


ro 




m 


KD 






00 
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o 


rH 


CM 


ro 










rH 


rH 




rH 


CM 


CM 


CM 


CM 


CM 


CM 


CM 


CM 




CM 

t 


CM 


CO 


CO 


CO 

i 


ro 










1 

rH 


1 

rH 




1 

rH 


1 

rH 


1 

rH 


rH 


i 

rH 


rH 


1 

rH 


I 

rH 


1 

rH 




rH 


t 

rH 


i 

rH 


1 

rH 


rH 


rH 










O 


O 




O 


O 


O 


O 


O 


O 


O 


O 
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o 


O 


o 


O 


O 


O 










r- 
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rH 
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m 
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<Ti 


o 


rH 
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CO 










rH 


rH 
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CM 


CM 


CM 


CM 


CM 


CM 


CM 




CM 
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CO 


ro 


CO 
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ro 
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V© 
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rH 
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[■'£ 
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Table 400 Lambda FR1 GLG sequences 
! VL1 



10 



f 15 



m 20 



u 25 



30 



! VL2 



! VL3 



CAG 


TCT 


GTG 


CTG 


ACT 


CAG 


CCA 


CCC 


TCG 


GTG 


TCT GAA 


GCC 


CCC 


AGG 


CAG 


AGG 


GTC 


ACC 


ATC 


TCC 


TGT 


! la 


cag 


tct 


gtg 


ctg 


acG 


cag 


ccG 


ccc 


tcA 


gtg 


tct gGG 


gcc 


ccA 


Ggg 


cag 


agg 


gtc 


acc 


ate 


tec 


tgC 


! le 


cag 


tct 


gtg 


ctg 


act 


cag 


cca 


ccc 


tcA 


gCg 


tct gGG 


Acc 


ccc 


Ggg 


cag 


agg 


gtc 


acc 


ate 


tcT 


tgt 


! lc 


cag 


tct 


gtg 


ctg 


act 


cag 


cca 


ccc 


tcA 


gCg 


tct gGG 


Acc 


ccc 


Ggg 


cag 


agg 


gtc 


acc 


ate 


tcT 


tgt 


! lg 


cag 


tct 


gtg 


Ttg 


acG 


cag 


ccG 


ccc 


tcA 


gtg 


tct gCG 


gcc 


CCA 


GgA 


cag 


aAg 


gtc 


acc 


ate 


tec 


tgC 


! lb 


CAG 


TCT 


GCC 


CTG 


ACT 


CAG 


CCT 


CCC 


TCC 


GCG 


TCC GGG 


TCT 


CCT 


GGA 


CAG 


TCA 


GTC 


ACC 


ATC 


TCC 


TGC 


! 2c 


cag 


tct 


gcc 


ctg 


act 


cag 


cct 


cGc 


tcA 


gTg 


tec ggg 


tct 


cct 


gga 


cag 


tea 


gtc 


acc 


ate 


tec 


tgc! 


2e 


caq 


tct 


qcc 


ctq 


act 


cag 


cct 


Gcc 


tec 


qTq 


tcT qqq 


tct 


cct 


gga 


cag 


tcG 


Ate 


acc 


ate 


tec 


tgc 


! 2a2 


cag 


tct 


gcc 


ctg 


act 


cag 


cct 


ccc 


tec 


gTg 


tec ggg 


tct 


cct 


gga 


cag 


tea 


gtc 


acc 


ate 


tec 


tgc 


! 2d 


cag 


tct 


gcc 


ctg 


act 


cag 


cct 


Gcc 


tec 


gTg 


tcT ggg 


tct 


cct 


gga 


cag 


tcG 


Ate 


acc 


ate 


tec 


tgc 


! 2b2 


TCC 


TAT 


GAG 


CTG 


ACT 


CAG 


CCA 


CCC 


TCA 


GTG 


TCC GTG 


TCC 


CCA 


GGA 


CAG 


ACA 


GCC 


AGC 


ATC 


ACC 


TGC! 


3r 


tec 


tat 


gag 


ctg 


act 


cag 


cca 


cTc 


tea 


gtg 


tcA gtg 


Gcc 


cTG 


gga 


cag 


acG 


gcc 


agG 


atT 


acc 


tgT 


! 3j 


tec 


tat 


gag 


ctg 


acA 


cag 


cca 


ccc 


tcG 


gtg 


tcA gtg 


tec 


cca 


gga 


caA 


acG 


gcc 


agG 


ate 


acc 


tgc! 


3p 


tec 


tat 


gag 


ctg 


acA 


cag 


cca 


ccc 


tcG 


gtg 


tcA gtg 


tec 


cTa 


gga 


cag 


aTG 


gcc 


agG 


ate 


acc 


tgc 


! 3a 


tcT 


tct 


gag 


ctg 


act 


cag 


GAC 


ccT 


GcT 


gtg 


tcT gtg 


Gcc 


TTG 


gga 


cag 


aca 


gTc 


agG 


ate 


acA 


tgc 


! 31 




tec 


tat 


gTg 


ctg 


act 


cag 


cca 


ccc 


tea 


gtg 


tcA gtg 


Gcc 


cca 


gga 


Aag 


acG 


gcc 


agG 


atT 


ace 


tgT 


! 3h 


tec 


tat 


gag 


ctg 


acA 


cag 


cTa 


ccc 


tcG 


gtg 


tcA gtg 


tec 


cca 


gga 


cag 


aca 


gcc 


agG 


ate 


acc 


tgc 


! 3e 


tec 


tat 


gag 


ctg 


aTG 


cag 


cca 


ccc 


tcG 


gtg 


tcA gtg 


tec 


cca 


gga 


cag 


acG 


gcc 


agG 


ate 


acc 


tgc 


! 3m 


tec 


tat 


gag 


ctg 


acA 


cag 


cca 


Tec 


tea 


gtg 


tcA gtg 


tcT 


ccG 


gga 


cag 


aca 


gcc 


agG 


ate 


acc 


tgc 


! V2-19 



! VL4 

10 CTG CCT GTG CTG ACT CAG CCC CCG TCT GCA TCT GCC 

TTG CTG GGA GCC TCG ATC AAG CTC ACC TGC ! 4c 
cAg cct gtg ctg act caA TcA TcC tct gcC tct gcT 
tCC ctg gga Tec teg Gtc aag etc acc tgc ! 4a 
cAg cTt gtg ctg act caA TcG ccC tct gcC tct gcc 
tCC ctg gga gcc teg Gtc aag etc acc tgc ! 4b 

! VL5 

CAG CCT GTG CTG ACT CAG CCA CCT TCC TCC TCC GCA 
TCT CCT GGA GAA TCC GCC AGA CTC ACC TGC ! 5e 
cag Get gtg ctg act cag ccG Get tec CTc tcT gca 
tct cct gga gCa tcA gcc agT etc acc tgc ! 5c 
cag cct gtg ctg act cag cca Tct tec CAT tcT gca 
tct Tct gga gCa tcA gTc aga etc acc tgc ! 5b 

! VL6 

AAT TTT ATG CTG ACT CAG CCC CAC TCT GTG TCG GAG 
TCT CCG GGG AAG ACG GTA ACC ATC TCC TGC ! 6a 

! VL7 





CAG 


ACT 


GTG 


GTG 


ACT 


CAG 


GAG 


CCC 


TCA 


CTG 


ACT GTG 




TCC 


CCA 


GGA 


GGG 


ACA 


GTC 


ACT 


CTC 


ACC 


TGT 


! 7a 




cag 


Get 


gtg 


gtg 


act 


cag 


gag 


ccc 


tea 


ctg 


act gtg 


30 


tec 


cca 


gga 


ggg 


aca 


gtc 


act 


etc 


acc 


tgt 


! 7b 



! VL8 



m 

Id 



s . 3 



15 



fj. 20 



esq 

H 
fi 



25 



CAG ACT GTG GTG ACC CAG GAG CCA TCG TTC TCA GTG 
TCC CCT GGA GGG ACA GTC ACA CTC ACT TGT ! 8a 



! VL9 

CAG CCT GTG CTG ACT CAG CCA CCT TCT GCA TCA GCC 
TCC CTG GGA GCC TCG GTC ACA CTC ACC TGC ! 9a 

! VL10 

5 CAG GCA GGG CTG ACT CAG CCA CCC TCG GTG TCC AAG 

GGC TTG AGA CAG ACC GCC ACA CTC ACC TGC ! 10a 



C3 

m 

la 

£ 33, 

u 

in- 

~-4 

t-Jl 

- F 



JO 



Table 405 RERSs found in human lambda FR1 GLGs 

! There are 31 lambda GLGs 

Mlyl NnnnnnGACTC 25 



1 
9 
20 
25 
31 



6 
6 
6 
6 
6 



3: 


6 


4 : 


6 


6: 


6 


7: 


6 


8: 


6 


10: 


6 


11: 


6 


12: 


6 


15: 


6 


16: 


6 


21: 


6 


22: 


6 


23: 


6 


23: 


50 


24: 


6 


25: 


50 


26: 


6 


27: 


6 


28: 


6 


30: 


6 



There are 23 hits at base# 6 

-"- GAGTCNNNNNn 
26: 34 



Mwol GCNNNNNnngc 



20 



P 

•a zr 

C5 

i .5 

id 
□ 



LI 



75 



25 



30 



31: 12 
There are 



1: 9 


2: 9 


3: 


9 


4: 


9 


11: 


9 


11: 


56 


12: 9 


13: 9 


14 : 


9 


16: 


9 


17: 


9 


18: 


9 


19: 9 


20: 9 


23: 


9 


24: 


9 


25: 


9 


26: 


9 


30: 9 


31: 9 


















There are 


19 hits 


at 


baseH 


9 












Hinfl Gantc 








27 










1: 12 


3: 12 


4: 


12 


6: 


12 


7: 


12 


8: 


12 


9: 12 


10: 12 


11: 


12 


12: 


12 


15: 


12 


16: 


12 


20: 12 


21: 12 


22: 


12 


23: 


12 


23: 


46 


23: 


56 


24: 12 


25: 12 


25: 


56 


26: 


12 


26: 


34 


27: 


12 


28: 12 


30: 12 


31: 


12 














There are 


23 hits 


at 


basefl 


12 












Plel gactc 








25 










1: 12 


3: 12 


4: 


12 


6: 


12 


7: 


12 


8: 


12 


9: 12 


10: 12 


11: 


12 


12: 


12 


15: 


12 


16: 


12 


20: 12 


21: 12 


22: 


12 


23: 


12 


23: 


56 


24: 


12 


25: 12 


25: 56 


26: 


12 


27: 


12 


28: 


12 


30: 


12 



23 hits at base# 12 



35 gagtc 
26: 34 






Ddel 


Ctnag 










32 












1: 


14 


2: 


24 


3: 


14 


3: 


24 


4: 


14 


4: 


24 




5: 


24 


6: 


14 


7: 


14 


7: 


24 


8: 


14 


9: 


14 


5 


10: 


14 


11: 


14 


11: 


24 


12: 


14 


12: 


24 


15: 


5 




15: 


14 


16: 


14 


16: 


24 


19: 


24 


20: 


14 


23: 


14 




24: 


14 


25: 


14 


26: 


14 


27: 


14 


28: 


14 


29: 


30 




30: 


14 


31: 


14 




















There < 


are 21 hits 


: at 


base# 14 












10 




























BsaJI Ccnngg 










38 












1: 


23 


1: 


40 


2: 


39 


2: 


40 


3: 


39 


3: 


40 




4: 


39 


4: 


40 


5: 


39 


11: 


39 


12: 


38 


12: 


39 




13: 


23 


13: 


39 


14: 


23 


14: 


39 


15: 


38 


16: 


39 


15 


17: 


23 


17: 


39 


18: 


23 


18: 


39 


21: 


38 


21: 


39 


□ 


21: 


47 


22: 


38 


22: 


39 


22: 


47 


26: 


40 


27: 


39 




28: 


39 


29: 


14 


29: 


39 


30: 


38 


30: 


39 


30: 


47 


U 


31: 


23 


31: 


32 



















IJ There are 17 hits at base# 39 

IZ 20 There are 5 hits at base# 38 

m There are 5 hits at base# 40 Makes cleavage ragged. 

§ ^ . 



Mnll cctc 35 



1: 


23 


2: 


23 


3: 


23 


4 : 


23 


5: 


23 


6: 


19 


6: 


23 


7 : 


19 


8: 


23 


9: 


19 


9: 


23 


10: 


23 


11: 


23 


13: 


23 


14: 


23 


16: 


23 


17: 


23 


18: 


23 


19: 


23 


20: 


47 


21: 


23 


21: 


29 


21: 


47 


22: 


23 


22: 


29 


22: 


35 


22: 


47 


23: 


26 


23: 


29 


24: 


27 


27: 


23 


28: 


23 


30: 


35 


30: 


47 


31: 


23 







There are 21 hits at base# 23 

30 There are 3 hits at base# 19 

There are 3 hits at base# 29 

There are 1 hits at base* 26 

There are 1 hits at base# 27 These could make cleavage ragged. 

-"- gagg 7 

35 1: 48 2: 48 3: 48 4: 48 27: 44 28: 44 



29: 44 



BssKI Nccngg 



39 



C3 

y 

3 : 5 



70 



75 



20 



1: 


40 


2: 


39 


3: 


39 


3: 


40 


4 : 


39 


4 : 


40 


5: 


39 


6: 


31 


6: 


39 


7: 


31 


7: 


39 


8: 


39 


9: 


31 


9: 


39 


10: 


39 


11: 


39 


12: 


38 


12: 


52 


13: 


39 


13: 


52 


14: 


52 


16: 


39 


16: 


52 


17: 


39 


17: 


52 


18: 


39 


18: 


52 


19: 


39 


19: 


52 


21: 


38 


22: 


38 


23: 


39 


24: 


39 


26: 


39 


27: 


39 


28: 


39 


29: 


14 


29: 


39 


30: 


38 















There are 21 hits at base# 39 

There are 4 hits at base# 38 

There are 3 hits at base# 31 

There are 3 hits at base# 40 



Ragged 



BstNI CCwgg 
1:41 2 
9: 40 
13: 53 
18: 40 
24: 40 



10: 
14: 
18: 
27: 



40 
53 
40 
39 



U 25 



30 

40 5: 40 6: 40 7: 
40 11: 40 12: 39 12: 
53 16: 40 16: 53 17: 
53 19: 53 21: 39 22; 
40 28: 40 29: 15 29: 40 
There are 17 hits at base# 40 
There are 7 hits at base# 53 
There are 4 hits at base# 39 
There are 1 hits at base# 41 Ragged 



8: 40 

13: 40 

17: 53 

23: 40 

30: 39 



PspGI ccwgg 30 



1: 


41 


2: 


40 


5: 


40 


6: 


40 


7: 


40 


8: 


40 


9: 


40 


10: 


40 


11: 


40 


12: 


39 


12: 


53 


13: 


40 


13: 


53 


14: 


53 


16: 


40 


16: 


53 


17: 


40 


17: 


53 


18: 


40 


18: 


53 


19: 


53 


21: 


39 


22: 


39 


23: 


40 


24: 


40 


27: 


40 


28: 


40 


29: 


15 


29: 


40 


30: 


39 



There are 17 hits at base# 40 
There are 7 hits at base# 53 
35 There are 4 hits at base# 39 




There are 1 hits at base# 41 



ScrFI CCngg 39 



1: 41 


2: 40 


3: 


40 


3: 


41 


4: 


40 


4: 


41 


5: 40 


6: 32 


6: 


40 


7: 


32 


7: 


40 


8: 


40 


9: 32 


9: 40 


10: 


40 


11: 


40 


12: 


39 


12: 


53 


13: 40 


13: 53 


14: 


53 


16: 


40 


16: 


53 


17: 


40 


17: 53 


18: 40 


18: 


53 


19: 


40 


19: 


53 


21: 


39 


22: 39 


23: 40 


24: 


40 


26: 


40 


27: 


40 


28: 


40 


29: 15 


29: 40 


30: 


39 














There are 


21 hits 


at 


base# 


40 












There are 


4 hits 


at 


base# 


39 












There are 


3 hits 


at 


base# 


41 













0 


75 


Maelll 


gtnac 








16 










in 




1: 52 


2: 52 


3: 


52 


4 : 


52 


5: 


52 


6: 


52 


: n 

i ; ~ 

•a - ' 




7: 52 


9: 52 


26: 


52 


27: 


10 


27: 


52 


28: 


10 


3 .= 




28: 52 


29: 10 


29: 


52 


30: 


52 










e ■ - 


20 


There 


are 13 hits 


at 


base# 


52 
















Tsp45l 


gtsac 








15 










as 




1: 52 


2: 52 


3: 


52 


4 : 


52 


5: 


52 


6: 


52 


r™" 

Q 




7: 52 
29: 10 


9: 52 
29: 52 


27: 
30: 


10 
52 


27: 


52 


28: 


10 


28: 


52 


3 s 
a: sa 


25 


There 


are 12 hits 


at 


base# 


52 













HphI tcacc 2 6 



1: 


53 


2: 


53 


3: 


53 


4 : 


53 


5: 


53 


6: 


53 


7: 


53 


8: 


53 


9: 


53 


10: 


53 


11: 


59 


13: 


59 


14: 


59 


17: 


59 


18: 


59 


19: 


59 


20: 


59 


21: 


59 


22: 


59 


23: 


59 


24: 


59 


25: 


59 


27: 


59 


28: 


59 


30: 


59 


31: 


59 



















There are 16 hits at base# 59 
There are 10 hits at base# 53 



35 



BspMI ACCTGCNNNNn 14 
11: 61 13: 61 14: 61 17: 61 18: 61 19 
20: 61 21: 61 22: 61 23: 61 24: 61 25 
30: 61 31: 61 
5 There are 14 hits at base# 61 Goes into CDR1 




Table 500: h3401-h2 captured Via CJ with BsmAI 

! 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 
1SAQDIQMTQSPATLS 
a GT GCA C aa gac ate cag atg acc cag tct cca gec acc ctg tct 
5 ! ApaLI ... a gec acc ! 

L25,L6 ,L20,L2,L16,A11 

! Extender Bridge. . . 

! 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 
70 ! V S PGERATLS CRAS Q 
gtg tct cca ggg gaa agg gec acc etc tec tgc agg gec agt cag 







! 31 


32 


33 


34 


35 


36 


37 


38 


39 


40 


41 


42 


43 


44 


45 






! S 


V 


S 


N 


N 


L 


A 


W 


Y 


Q 


Q 


K 


P 


G 


Q 




75 


agt 


gtt 


agt 


aac 


aac 


tta 


gec 


tgg 


tac 


cag 
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What happens in the top strand: 



i 



(VL133-2a2*) 5 f -g tct cct g | ga cag teg ate 



1 



(VL133-31*) 5'-g gee ttg g I ga cag aca gtc 



(VL133-2c*) 5'-g tct cct g | ga cag tea gtc 



I 

10 (VL133-lc*) 5 f -g gec cca glgg cag agg gtc 

! 

! The following Extenders and Bridges all encode the AA sequence of 2a2 for 
codons 1-15 

! 1 

15 (ON__LamExl33) 5 1 -ccTcTgAcTgAgT gcA cAg - 
I 

! 2 3 4 5 6 7 8 9 10 11 12 

AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 

I 

20 ! 13 14 15 

tcC ccG g ! 2a2 
! 1 
(ON_LamBl-133) [RC] 5 1 - CcTcTgAcTgAgT gcA cAg - 

£3 i 



25 I 2 3 4 5 6 7 8 9 10 11 12 

AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 



"S -BP 

3:5- I 

^ ! 13 14 15 

i 

(ON_LamB2-133) [RC] 5 1 -CcTcTgAcTgAgT gcA cAg - 
I 

35 I 2 3 4 5 6 7 8 9 10 11 12 

AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 



I 

! 13 14 15 



tcC ccG g ga cag aca gt-3 1 ! 31 !;.;i»'.J^ 

"i * ' " " " * " "* v " ■ " aw ' ' '** ■ ' wa ■ v " *" : " ::v ■ * *' : 

i 

(ON_LamB3-133) [RC] 5 1 -ccTcTgAcTgAgT gcA cAg - 

45 ! 

! 2 3 4 5 6 7 8 9 10 11 12 

AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 

I 

! 13 14 15 

50 ^.9.9....9. cG ....9..„9 a . ca i tca 9 t "3 1 ! 2c |;#;,iyi't^ 

C^^ 1 ?^^^^^.^^^^!^.^^^ 5 © 




.■r.lyl.y.- 
||| 

m 

(ON_LamB4-133) [RC] 5 1 -CcTcTgAcTgAgT gcA cAg - 

55 ! 



i 



I 
I 

i 



2 3 4 5 6 7 8 9 10 11 12 
AGt gcT TtA acC caA ccG gcT AGT gtT AGC ggT- 



13 14 15 
tcC ccG g gg cag agg gt-3' 



(ON_Laml33PCR) 5 1 -ccTcTgAcTgAgT gcA cAg AGt gc-3 1 
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630: Oligonucleotides used to clone CDR1/2 diversity 

equences are 5 T to 3'. 

N_CDlBsp, 30 bases 

cTcAcTgg cTT c cggA 
3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 

cAcT.TTcTcT 
21 22 23 24 25 26 27 28 29 30 

N_Brl2, 42 bases 

AAAcccAcTccAAAcc 
3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 

TAccAggAg cTTggcg 
21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 

c c c A 
39 40 41 42 

N_CD2Xba, 51 bases 

AAgg cAgTgAT cTAgA 
3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 

TAgTgAAgcgAccTT T. 
21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 

cggAgTcAgcATA 
39 40 41 42 43 44 45 46 47 48 49 50 51 

NT BotXba, 23 bases 
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